Submitted to the Annals of Probability 
arXiv:[0806 . 2224 [ma th . PR] | 



TREE- VALUED RESAMPLING DYNAMICS 
(MARTINGALE PROBLEMS AND APPLICATIONS) 
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The measure-valued Fleming- Viot process is a diffusion which 
models the evolution of allele frequencies in a multi-type population. 
In the neutral setting the Kingman coalescent is known to generate 
the genealogies of the "individuals" in the population at a fixed time. 
The goal of the present paper is to replace this static point of view 
on the genealogies by an analysis of the evolution of genealogies. 

Ultra-metric spaces extend the class of discrete trees with edge 
length by allowing behavior such as infinitesimal short edges. We en- 
code genealogies of the population at fixed times as elements in the 
space of (isometry classes of) ultra-metric measure spaces. The addi- 
tional probability measure on the ultra-metric space allows to sample 
from the population. We equip this state space by the Gromov-weak 
topology and use well-posed martingale problems to construct tree- 
valued resampling dynamics for both the finite population (tree-valued 
Moran dynamics) and the infinite population (tree-valued Flemmg- 
Viot dynamics). 

As an application we study the evolution of the distribution of the 
lengths of the sub-trees spanned by sequentially sampled "individu- 
als". We show that ultra-metric measure spaces are uniquely deter- 
mined by the distribution of the infinite vector of the subsequently 
evaluated lengths of sub-trees. 



1. Introduction. In the present paper we construct and study the 
evolution of the genealogical structure of the neutral multi-type popula- 
tion model called the Fleming- Viot process ( \FV78\ IFV791 IDaw93l IEK931 
IDGV951 IEth01| ). This process arises as the large population limit of Moran 
models. The Moran model (MM) can be described as follows: consider a 
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population of fixed size with finitely many individuals which carry (genetic) 
types. Each pair resamples at constant rate. Resampling of a pair means that 
the pair dies and is replaced by two new individuals which are considered 
as the offspring of one of the randomly chosen individuals. 

In stochastic population models genealogical trees are frequently used. 
This applies to both branching and resampling models. In branching mod- 
els, genealogical trees arise, for example, as the Kallenberg tree and the 
Yule tree ( |Kal77[ IEQ94] ). More examples include continuum (mass) limits 
in branching models as the Brownian continuum random tree (CRT) or the 
Brownian snake ( [Ald91l ILG99j ) . More general branching mechanisms lead 
to more general genealogies such as Levy trees ( [DLG02] ) which are the 
infinite variance offspring distribution counterpart of the Brownian CRTs 
or trees arising in catalytic branching systems ( [GPW09b] ). to name just a 
few. In resampling models, which are used for populations with a fixed size 
(or total mass), genealogical trees can be generated by coalescent processes. 
A moment duality leads to the connection of the Fleming- Viot process to 
the Kingman coalescent tree ( |Kin82al lAld93l lEvaOOl IGLW05p . More gen- 
eral Fleming- Viot processes which allow for an infinite offspring variance 
are studied in |BG0 3]. Their genealogical relationships are described by A- 
coalescents ( [Pit99l lGRW09aj ) . 

Notice that coalescent trees describe the genealogy of a population at a 
fixed time and give therefore a static picture only. The main goal of the 
present paper is to construct a dynamic picture. That is, we obtain the tree- 
valued Moran dynamics and the tree-valued Fleming-Viot dynamics model- 
ing the evolution of genealogies (in forward time). We use techniques from 
the theory of martingale problems. 

In the construction of the tree-valued resampling dynamics we stick to 
the simplest case of a neutral population model, i.e., neither mutation, nor 
selection and recombination are built into the model. Due to discreteness, 
the tree-valued Moran dynamics can be constructed by standard theory on 
piece-wise deterministic jump processes. However, the construction of the 
tree- valued Fleming-Viot dynamics requires more thought. 

For coding genealogical trees and defining convergence, we will rely on 
the fact that genealogical distances between individuals define a metric. 
Note that genealogical distances between individuals which belong to the 
population at a fixed time satisfy the ultra-metricity property (see also 
[BK061 IMPV87) for the connection of ultra- metric spaces and tree-like struc- 
tures in spin-glass theory). To take the individuals' contribution to the pop- 
ulation into account we further equip the resulting ultra-metric space with 
a probability measure. We then follow the theory of (ultra-) metric measure 
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spaces equipped with the Gromov-weak topology as developed in [GP W09a] . 
Convergence in the Gromov-weak topology means convergence of all finite 
sub-spaces sampled with respect to the corresponding probability measures. 
The space of ultra-metric measure trees complements the space of real trees 
(see |Dre84l IDMT96} ITer97j and Remark E2} equipped with the Gromov- 
Hausdorff topology ( |Gro991 IBBIOlj ). Evolving genealogies in exchangeable 
population models - including the models under consideration - have been 
described by look-down processes ( \DK96\ [DK99b[ [DK99 aj ) . Look-down pro- 
cesses contain all information available in the model. In particualar, they 
contain - though in an implicit way - all information about the trees. Since 
the crucial point in the construction of look-down processes is the use of 
labels as coordinates, they encode information which is often not needed. 
A first approach in the direction of a coordinate-free description has been 
already taken in spatial settings via historical processes in the context of 
branching ( |DP91j ) and resampling models ( [GLW05] ) . 

Markov dynamics with values in the space of "infinite" or continuum trees 
have been constructed only recently. These include excursion path valued 
Markov processes with continuous sample paths - which can therefore be 
thought of as tree-valued diffusions - as investigated in [ZamOlt [Zam02j 
IZam03j . and dynamics working with real-trees, for example, the so-called 
root growth with re-grafting ( [EPW06] ). the so-called subtree prune and re- 
graft move ( |EW06| ) and the limit random mapping (|EL08|). While the 
RGRG have a projective property allowing for an explicit construction of the 
Feller semi-group as the limit semi-group of "finite" tree-valued dynamics 
arising in an algorithm for constructing uniform spanning trees, the SPR 
and the limit random mapping were constructed as candidates of the limit of 
"finite" tree- valued dynamics using Dirichlet forms. Unfortunately, Dirichlet 
forms are often inadequate for proving convergence theorems. Hence in order 
to be able to show that indeed the tree- valued Moran dynamics converge to 
the tree- valued Fleming- Viot dynamics, we use the characterization of the 
tree-valued dynamics as solutions of well-posed martingale problems. This 
is the first example in the literature where tree-valued Markov dynamics 
constructed as the solution of a well-posed martingale problem. 

An important consequence of the construction of the tree-valued resam- 
pling dynamics as solutions of well-posed martingale problems is that it 
allows to study the evolution of continuous functionals of these processes 
and characterize which functionals form strong Markov processes. As an ex- 
ample we describe the evolution of the distribution of tree-lengths of subtrees 
spanned by subsequentially sampled "individuals". With the depth of the 
tree another related interesting functional has been studied earlier in [PW06j 
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using the look-down construction of |DK99a] . Since the depth measures the 
time to the most recent ancestor of all "individuals" of the population, it 
is not continuous with respect to the Gromov-weak topology. It is shown in 
[PW06| that the resulting process is not Markovian. 

2. Main results (Theorems (TJ2] and [3]). In this section we state our 
main results. In Subsection 12.11 we recall concepts and terminology used to 
define the state space which consists of (ultra-)metric measure spaces and 
which we equip with the Gromov-weak topology. In Subsection 12.21 we state 
the tree-valued Fleming- Viot martingale problem and its well-posedness, 
and present the approximation by tree-valued particle dynamics in Subsec- 
tion 12.31 In Subsection 12.41 we identify a unique equilibrium and state that 
it will be approached as time tends to infinity. In Subsection 12.51 we discuss 
possible extensions of the model. 

2.1. State space: metric measure spaces. In |GPW 09aj topological as- 
pects of the space of metric measure spaces equipped with the Gromov-weak 
topology are investigated. We will use this space as the state space for the 
evolution of the genealogies of our models. In this subsection we recall basic 
facts and notation. 

As usual, given a topological space (X, O) we denote by M.\ (X) (M. / (X)) 
the space of all probability (finite) measures, defined on the Borel-a-algebra 
of X and by => weak convergence in M\(X). Recall that the support of fj,, 
supp(/x), is the smallest closed set Xq C X such that h{Xq) = 1. The push 
forward of /U under a measurable map ip from X into another metric space 
(Z,rz) is the probability measure ip*(i 6 M.\{Z) defined by 

(2.1) ^(A):=^-\A)), 

for all Borel subsets A C X. We denote by B(X) and Cj,(X) the bounded real- 
valued functions on X which are measurable and continuous, respectively. 

A metric measure space is a triple (X, r, fi) where (X, r) is a metric space 
such that (supp(/i), r) is complete and separable and ji € M.\{X) is a proba- 
bility measure on (X, r). Two metric measure spaces (X, r, /u) and (X' , r',fj,') 
are measure-preserving isometric or equivalent if there exists an isometry (p 
between supp(/x) and supp(/x') such that // = <^*/x. It is clear that the prop- 
erty of being measure-preserving isometric is an equivalence relation. We 
write (X, r, //) for the equivalence class of a metric measure space (X, r, fi) . 
Define the set of (equivalence classes of) metric measure spaces 

(2.2) M := {jt = (X,r,[i) : (X,r,fi) metric measure space}. 
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If (X, r, fi) is such that r is only a pseudo-metric on X, (i.e. r(x,y) = is 
possible for x ^ y) we can still define its measure-preserving isometry class. 
Since this class contains also metric measure spaces, there is a bijection be- 
tween the set of pseudo-metric measure spaces and the set of metric measure 
spaces and we will use both notions interchangeably. 

We are typically only interested in functions of metric measure spaces 
that do not describe artifacts of the chosen representation, i.e., which are 
invariant under measure-preserving isometries. These are of a special form 
which we introduce next. For a metric space (X, r) we define by 

(2.3) R( x ^ : { XN 

{ ((a*)i>i) ^ (na*>Si))i< i<3 - 

the map which sends a sequence of points in X to its (infinite) distance 
matrix, and denote, for a metric measure space (X, r, (i), the distance matrix 
distribution of (X, r, fj,) by 

(2.4) i/ x > r ^ := (R^)^ m G Mi$i!p). 

Obviously, i/^ X ' r '^ depends on (X, r, //) only through its measure-preserving 
isometry class t = (X,r,/J,). We can therefore define: 

Definition 2.1 (Distance matrix distribution). The distance matrix 
distribution v K of * G M is the distance matrix distribution v^ x,r ^ of an 
arbitrary representative (X,r,fi) G k.- 



By Gromov's reconstruction theorem metric measure spaces are uniquely 
determined by their distance matrix distribution (see Section 3^.5 in |Gro99j 
and Proposition 2.6 in [GP W09a] ) . We therefore base our notion of conver- 
gence in M on the convergence of distance matrix distributions. In [GPW09a] 
we introduced the Gromov-weak topology. In this topology a sequence (Kn) n <=N 
in M converges to t € M if and only if 

(2.5) z/* n =4> as n — > oo 

H O 

in the weak topology on A^i(M+ ) (and, as usual, IR_j_ equipped with the 
product topology); compare Theorem 5 of [GP W09a] . Although {u K : ?c G 
M} C Mi(I®) is not closed, we could show in Theorem 1 of [GPW09a] 
that M, equipped with the Gromov-weak topology, is Polish. 
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Several sub-spaces of M will be of special interest throughout the paper. 
Above all, these are the ultra-metric and compact metric measure spaces. 

(The equivalence class of) a metric measure space (X, r, fi) is called ultra- 
metric iff 

(2.6) r(u, w) < r(u, v) V r(v, w), 
for ^-almost all u,v,w & X. Define 

(2.7) U := {u € M : u is ultra- metric}. 

Remark 2.2 (Ultra-metric spaces are trees). Notice that there is a close 
connection between ultra-metric spaces and K-trees, i.e., complete path- 
connected metric spaces (X, rx ) which satisfy the four-point condition 

r x (x 1 ,x 2 ) +r x (x 3 ,x 4 ) 

< ma,x{r x (x 1 ,x 3 ) + r x (x 2 ,X4),r x (x 1 ,X4) + r x (x 2 ,x 3 )}, 

for all x 1 ,x 2 ,x 3 ,x 4 G X (see, for example, |Dre84l IDMTM ITer97| ) . On the 
one hand, every complete ultra-metric space (U, ru) spans a path-connected 
complete metric space (X, r x ) which satisfies the four point condition, such 
that (U, rjj) is isometric to the set of leaves X\X°. On the other hand, given 
an M-tree (X, r x ) and a distinguished point p£l which is often referred to 
as the root of (X, r x ), the level sets X 1 := {x G X : r(p,x) = t}, for some 
t > 0, form ultra-metric sub-spaces of (X, r x ). 

Because of this connection between ultra-metric spaces and real trees, 
ultra-metric spaces are often (especially in phylogenetic analysis) referred 
to as ultra-metric trees. □ 

The next lemma implies that U equipped with the Gromov-weak topology 
is again Polish. 

Lemma 2.3. The sub-space U C M is closed. 



Proof. Let (t/ n .)neN be a sequence in U and i eM such that u n — > ^ in 
the Gromov-weak topology, as n — > oo. Equivalently, by (|2.5p . u Un =^ v K in 

o 

the weak topology on M.\(W+ ), as n — > oo. Consider the open set 
(2.9) 
A 

■= {{n,j)l<i<j ■ r±,2 > r 2 3 V n i3 or r 2i3 > r lj2 V r 1)3 or r 1)3 > n >2 V r 2j3 }. 
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By the Portmanteau Theorem, v K (A) < lim inf n _»oo v Un (A) = 0. Thus, (|2.6p 
holds for //® 3 -all triples (u, v, w) G X 3 . In other words, k. is ultra-metric. □ 

(The equivalence class of) a metric measure space (X, r, fi) is called com- 
pact if and only if the metric space (supp(//),r) is compact. Define 



Remark 2.4 (M c is not closed). 

(i) If = (X, r, /i) is a finite metric measure space, i.e, #supp(/u) < oo, 
then t eM c . 

(ii) Since elements of M can be approximated by a sequence of finite metric 
measure spaces (see the proof of Proposition 5.3 in |GPW09a] ). the 
sub-space M c is not closed. A similar argument shows that U c is not 
closed. 

(hi) In order to establish convergence within the space of compact metric 
measure spaces, we will provide a pre- compactness criterion in M c in 
Proposition 16.11 □ 

2.2. The martingale problem (TheoremUty ■ In this subsection we define 
the tree-valued Fleming- Viot dynamics as the solution of a well-posed mar- 
tingale problem. All proofs are given in Section [8J We start by recalling the 
terminology. 

Definition 2.5 (Martingale problem). Let (E, O) be a metrizable space, 
Po £ A4i(E), T a subspace of the space B(E) of bounded measurable func- 
tions on E and a linear operator on B(E) with domain T . 

The law P of an E-valued stochastic process X = (Xt)t>o is called a 
solution of the (Po, ^, ^-martingale problem if Xq has distribution Po, X 
has paths in the space T>e([0,oo)) of E-valued cadlag functions from [0,oo) 
equipped with the Skorohod topology, almost surely, and for all F € T , 



(2.10) 



M c :={ t eM: * is compact}. 



Moreover, we set 



(2.11) 



U c := UnM c . 



(2.12) 




is a P -martingale with respect to the canonical filtration. 

Moreover, the (Po, £1, ^-martingale problem is said to be well-posed if 
there is a unique solution P. 



8 GREVEN, PFAFFELHUBER, WINTER 

Recall that the classical measure-valued Fleming- Viot process £ = (Ct)t>o 
is a probability measure-valued diffusion process, which describes the evo- 
lution of allelic frequencies. In particular, for a fixed time t, the state Q G 
A4\(K) records the current distributions of allelic types on some (Polish) 
type space K. This process is defined as the unique solution of the martingale 
problem corresponding to the following second order differential operator: 
for sufficiently smooth F : M\{K) — > R, 

(2.13) ^F(fi) = IJ R J K (Kdu)S u {dv) - fi(du)ti(dv))^[5 u ,5 v ], 
where for // G M f (K) and G : Mf(K) — > R 

(2.14) ^T^] ■■=}^\{ G ^ + ^')) - G {»)) 

and ^V,//'] := <fc(™±f \p']) [//] , for ,/,// G whenever these 

limits exist (see, for example, |Daw93t p. 13]). 

For us more convenient is the following equivalent martingale problem 
(see |EK93j ) which corresponds to the operator Cfi defined on functions 
<& : M\(K) — » R of the form 

(2.15) %) = <C 0N , / C® N (du)0(u) 

with u = (ui, U2, ...) G -fC N and ^> G C(,(-fT N ) depending only on finitely many 
coordinates. On such functions (12.131) reads 

(2.16) fit*(C) = j£ ((C 0N , o 4,0 - (C^ N , 0}) , 

fc,«>l 

where the replacement operator 6^,1 is the map which replaces the £ th com- 
ponent of an infinite sequence by the /c th : 

(2.17) 9 k) i(ui,u 2 , . . . ,ui-i,ui,ui+i, ...):= (iti,tt 2 , • • • , ■ ■ •)• 

Here and in the following 7 G (0, 00) is referred to as the resampling rate. 

In order to state the martingale problem for the tree- valued Fleming- Viot 
process we need the notion of polynomials on M. 

Definition 2.6 (Polynomials). A function $ : M — > R is called a poly- 

H 

nomial if there exists a bounded, measurable test function <f> : R+ — ► R, 
depending only on finitely many variables such that 

(2.18) = <i/*,0) := / f m i/*(dr)0(c), 
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Remark 2.7 (Extension and degree of a polynomial). Let $ and (j> be 
as in Definition 12.61 

1. We extend the concept of polynomials to the case of positive, not 
necessarily bounded 4>. In this case, the right hand side of (|2. 18j) can 
be infinite. 

2. If n € N is the minimal number such that there exists <fi € B(R^), 
depending only on (rjj)i<j<j< n such that (|2.18p holds, n is called the 
degree of <I> and is a minimal test function. In this case we write 

:= (j>. 

Remark 2.8 (Properties of polynomials). Let <I> = <5 n '^ be as above, for 

(») 

some n £ N and <p : M4 2 — >■ R. 

(i) If K, = (X, r, pi), then 

(2.19) = J xt} fi m (d(x 1 ,x 2 ,...))cj ) {(r(x i ,x j )) 1 < l<j ), 

where ^ is the N-fold product measure of pi. 

(ii) Let S n be the set of all permutations of N which leave n + 1, n + 2, . . . 
fixed and = [J n S n . Given a £ Sqo, define 

(2.20) ?(0"ij)l<i<i) := (vOAff(jV0v»Cj))i<i<j- 
The symmetrization of </3 is given by 

(2.21) *=^E* *- 

By symmetry of z^' x , (u K ,(j)) = or equivalently, <]? n >^ = $ n >0. 

□ 

Recall from Subsection 12 . 1 1 the space B(M.+^) of bounded measurable real- 
valued functions on . An element <p £ ) is said to be differentiable 
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if for all 1 < i < j the partial derivatives exist and if J2i<i<j < 00 ■ 

In this case we put 

(2 . 22) d „ w ;= E = 2 E »L 

( N ) 

Denote by (R+ ) the space of all bounded and continuously differen- 

tiable real-valued functions (f) on RjP with bounded derivatives. The func- 
tion spaces we will use in the sequel are the space 

(2.23) n := G n : n G N, G £(R\_ 2 0}, 
and its sub-spaces 

(2.24) n ° := G n : n G N, G C 6 (R$?)}, 
and 

(2-25) n 1 := {$ n ^ : n G N,^ G C^M® )}. 

Remark 2.9 (Polynomials form an algebra). Observe that II, EE and 
II 1 are algebras of functions. Specifically, given $ n '^, and <3? m '^ G II, 

^2 26) <J> n '^ ■ = <^ n+m ^'t>'^)^ = ^n+m,(tl>,<p) m 

where for <f>, if> G B(R$) and feN, 

(2.27) :=0(r)-^l), 

with r^((r-ij)i< i<5 -) = (ri- H,i+j)i<i<r 

By Proposition 2.6 in |GPW09a] . II and il° separate points in M. Ob- 

( N ) ( N ) 
serving that C^(R_)_ ) is dense in C&(R+ ), we find that II 1 separates points 

as well. □ 



Example 2.10. Let n := 2 and 4>(g) := <f>(r 1)2 ) for (f> G C£(R). Then 
(2-28) (<M) 2 (l) = 0(ri )2 ) ■ fe,4). 

□ 
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Remark 2.11 (Symmetric polynomials and the Gromov-weak topology). 
Let t, ^i, ?(2, ■■■ in M and recall from Proposition 2.6 in |GPW09a] that 
$ G II separates points in M. Moreover, ^ — > ^ in the Gromov-weak 
topology iff $(^ n ) — ► $(^) for all $ G IT , as n — > oo (see Theorem 5 in 
[GP W09a] ) . Notice that the same is true if we restrict to $ G II 1 or to 
bounded and continuous test functions (p which are invariant under finite 
permutations (compare Remark 12. 8p . □ 



To lift the measure-valued Fleming- Viot process to the level of trees and 
thereby construct the tree-valued Fleming- Viot dynamics, we consider the 
martingale problem associated with the operator $7^ on jt w ith domain 
V(9J) = II 1 . To define ft T we let for $ = <£ n ^ G II 1 , 



(2.29) 



fi T 3> := ft T ' grow $ + J7 T ' res $. 



The growth operator Q>>s raw reflects the fact that the population gets 
older and therefore the genealogical distances grow at speed 2 as time goes 
on. We therefore put 

(2.30) tt T >s row $(«) := (i/\div(0)>. 
For the resampling operator let 

(2.31) fiT'"»<I>(«0 : = | E (V^°0M>-<^> 

l<k,l<n 

where we put k = for all k > 1, and 



(2.32) 



{6k,l{{ r i',j')l<i'<j')) 



i-.l 



r iAk,iVki 
r jAk,jVk, 



if i,j ^ / 

if j = Z, 
if i = Z. 



Note that tf$en for all $ G II 1 . 

Example 2.12. If k := 1 and Z := 3, then 



(2.33) 



'1,3 1 



/ n,2 r 1)4 ri )5 n,6 

ri,2 ^2,4 J" 2 ,5 ^2,6 

ri,4 ris ri 6 

^4,5 ^4,6 



V 



^5,6 



□ 
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Example 2.13. Let n := 2 and <fr(r) = 4>{r\^) be as in Example 12.101 
p 2 : L *— > r i,2 and z/rj : = (p2)*v u - Then 

(2.34) Q} (u\ 4>){u)= 2(1/5 , ?) + 7(0(0) - - 0» ■ 

□ 

Our first main result states that the martingale problem associated with 
(fi^n 1 ) is well-posed. 

Theorem 1 (Well-posed martingale problem). Fix Po € A^i(U). The 
(Po,^,!! 1 ) -martingale problem has a unique solution. 

This leads to the following definition. 

Definition 2.14 (The tree- valued Fleming- Viot dynamics). Fix Po € 
A^i(U). The tree- valued Fleming- Viot dynamics with initial distribution 
Po is a stochastic process with distribution P, the unique solution of the 
(Pojfi^n 1 ) -martingale problem. 

Proposition 2.15 (Sample path properties). The tree-valued Fleming- 
Viot dynamics U has the following properties. 

(i) li has sample paths in Cu([0, oo)), ^-almost surely, 
(ii) Ut £ U c , for all t > 0, P -almost surely. 

(Hi) For all t > 0, v Ut ((0, oo)©) = 1, P -almost surely. Moreover, the 
random set of exceptional times 

(2.35) {t 6 [0,oo) : is Ut ((0, oo)®) < 1} 

is a Lebesgue null-set, P-almost surely. In particular, if lit = (Ut,r t , lit), 
for all t > 0, then fi t is non-atomic for Lebesgue almost all t £ M + . 

Proposition 2.16 (Feller property). The tree-valued Fleming-Viot dy- 
namics U is a strong Markov process. Moreover, it has the Feller property, 
i.e., u I— » P[/ (Ut)\Uo = u] is continuous if f € C^iU). 
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Remark 2.17 (Extended Martingale problem). Notice that for all $ G 
II , (&(Ut))t>o is a M-valued continuous semi-martingale. Hence by Ito's for- 
mula for continuous semi-martingales (compare, for example, Theorem 32.8 
in [RWOOj) this implies that the operator (f^n 1 ) extends to an operator 
on the algebra 

(2.36) T= {/o$ : / e B(R),$ G n} 
with domain 

(2.37) JF 2 ' 1 := {/ o : / G (R), 3? € II 1 } 

for $ = {v ,(j)) as follows: 
(2.38) 

ft T (/o $)(„) = /'((z,",0)).^<Z,-,<A)(u) 

For F = / o $ = <J? n >* g n 1 , we therefore put 

(2.39) ^grow^) := /'«,/•,$) • (i/\div(</>)> 
and 

^• res F(«) :=!/'«!/",$). £ 

(2.40) fc >^ 

where (f> is the symmetrization of (f> as defined in (|2.2ip . □ 



Corollary 2.18 (Quadratic variation). Let U = (Ut)t>o be the tree- 
valued Fleming-Viot dynamics with initial distribution ~Pq G A^i(U) and 
$ = 3> n ></> £ n 1 , Then (&(Ut))t>o is a continuous P -semi-martingale with 
quadratic variation 

(2.4i) mu)) t = 7 n 2 f ds {iM%g>, 4>) n o et, n+1 - g>, 

Jo 
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Remark 2.19 (Quadratic variation for a representative). Assume that 
for all t > 0, lit = (Ut,r t ,[it)- Then the quadratic variation of ($(Ut))t>o 
can be expressed as 

(2.42) ($(U)} t = W [ ds (p s , (p s - (fi s , Ps )) 2 ), 

Jo 

where p s : U s — ► R is defined as 

(2.43) p s ( Ul ) := J / uf N (d(n 2 ,n 3 ,...))0((r s (n i ,u J )) 1 < i<J ). 

□ 



Remark 2.20 (Another martingale problem formulation). We say that 
$ S II is minimal if $ is not of the form $ ^ & ■ <I> for some <J>, $ £ II of 
degree > 0. Using (|2.40p and Corollary 12.181 it is straight-forward to give an 
alternative formulation for the martingale problem leading to the tree- valued 
Fleming- Viot dynamics: 

Let Po € A^i(U). A U-valued process U = (Ut)t>o is the tree-valued 
Fleming- Viot dynamics with initial distribution Po iff 

(2.44) U(U t ) - $(Uo) ~ I ds n^{U s )) 

v jo 7 *>o 

is a martingale for all minimal $ G II 1 with quadratic variation given by 

(1231]). □ 



Remark 2.21 (Connection of measure- and tree- valued Fleming- Viot). 
Let us connect the tree- and measure- valued Fleming- Viot dynamics. On the 
one hand, consider the tree-valued Fleming- Viot dynamics. At time t > 0, 
decompose lAt = (Ut,rt,pt) m disjoint 2t balls with respect to r£. Then 
the collection of balls charged by fit is a random finite set which can be 
ordered according to the weights of pt- On the other hand, consider the 
measure- valued Fleming- Viot process on [0; 1], started in a measure which 
is absolutely continuous with respect to the Lebesgue measure. At time t, 
this process has finitely many atoms. The size-ordered version of these atoms 
has the same distribution as the size-ordered weights of the 2t-balls of the 
tree-valued Fleming- Viot dynamics. □ 
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Remark 2.22 (The process U as a diffusion). Since U has the strong 
Markov property with continuous paths, one would like to consider it as 
a diffusion, and hence express its generator via second order differential 
operators similar to (|2.13p . Due to the lack of a linear structure defining 
differential operators on M is not straight forward at all. We therefore explain 
here only how to proceed on the level of representatives. 

Let F : U — > R. Then there exists a real valued function F acting on 
metric measure spaces (rather than their isometry classes) and such that for 
all (U, r, n) G U and (U, r, /i) G (U, r, n) , 



(2.45) F{(U,r,ii))=F{(U 1 r 3 fj)). 

We consider perturbations of the metric r and of the measure fi. That is, 
we define 

(2.46) dF ({U,r^)) = u r + } _ )); 

or e^O fc 

with (r + e)(u,v) := r(u,v) + el u ^ v , for all n, u G C/, and for each // G 
Mi((U,r)), 

(2.47) ^((^,/z)) :=li I ( p (c/)r +£/i /)_ j p (c/)r )}) 

whenever the limits exist and ^^f^ 1 ^', n"] := fa ( ^f^ W]) [A as 
usual. 

Then it is easy to check that U is a solution of the (r^ ,dlff , ^ r2 ' 1 )-martingale 
problem with the operator 
(2.48) 

n^ dm F{u) 

■= dF{U dr r,fl) + lj u j u Wu)8 u (dv) - ^u)^v))^[8 u M 

provided that (U,r,fj,) = u (recall J 72 ' 1 from (]2.37f) ) . 

Now the problem arises whether the generator of our Feller semigroup and 
the above differential operator on its domain do not only have a measure 
determing set in common but actually do agree. For that one would like to 
establish that the domain of the differential operator is dense in Cb(U) and 
J- 1,2 is dense in the space of sufficiently smooth functions. Such questions 
are usually resolved with the help of the Stone- Weierstrafi theorem which 
unfurtunately does not apply here since our state space U is not locally 
compact. □ 




Fig 1. The graphical representation of a Moran model of size N = 5. By resampling 
the genealogical relationships between individuals change. Arrows between lines indicate 
resampling events. The individual at the tip dies and the other one reproduces. At any 
time, genealogical relationships of individuals •, which are currently alive, can be read 
from this graphical representation. 



2.3. Particle approximation (Theorem^). A classical result in popula- 
tion genetics gives the approximation of the measure-valued Fleming- Viot 
process by a finite size particle model - the so called Moran model - in the 
limit of large population size (see e.g. [Daw93l[Eth01] ). In this model ordered 
pairs of individuals are replaced by new pairs in a way that the "children" 
choose a parent - which then becomes their common ancestor - indepen- 
dently at random from the parent pair. In this subsection we state that also 
the tree- valued Fleming- Viot dynamics can be approximated by tree- valued 
resampling dynamics which correspond to the Moran model. To prepare the 
statement notice that the Moran model is most conveniently defined via its 
graphical representation (see Figure [1]), which we formally describe in the 
sequel. 

Fix a population size JVeN, put I := {1,2, ...,N} and choose a metric 
?"o on X. 

Let then n := {rf' 3 ; i,j € 1} be a realization of a family of rate 7/2 
Poisson point processes. We say that for i, i! £ X and < s < t < 00 there 
is a path of descent from (i, s) to (i' , t) if there exist n £ N, s =: no < u\ < 
U2 < ••• < u n := t and ji := i,j n := i',ji, j n -i £ I, such that for all k £ 
{1, ...,n}, rf k - ljk (u k -i,u k ] = r/ Jfc ~ 1Jfc {n fc } = 1 and if 1 ^- 1 (u k -i, u k ) = for 
all m € I. 

Notice that for all (i,t) € T x R + and < s < t there exists almost surely, 
a unique 

(2.49) A s (i,t)el 
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such that there is a path from (A s (i,t),s) to (i,t). In the following we refer 
to the individual A s (i,t) as the ancestor of (i, t) back at time s. For t > 0, 
define the pseudo-metric rt on X by 
(2.50) 

= f 2(i — sup{s € [0, t] : A^i, i) = j4 s (j, i)}) , if A) (M) = A)(j, t), 
■ \ 2t + r (A (i,t),A (j,t)), if A (i,t) ^A (j,t), 

for i, j £l This induces as usual a metric space by passing to equivalence 
classes. 

We then call 1A N = {U^)t>o the tree-valued Moran dynamics with popu- 
lation size N, where for t > 0, (recall (|2.2|) ) 



(2-51) Z^:=(Z,r?,^. el( y 



The next result states that the convergence of Moran- to Fleming- Viot 
dynamics also holds on the level of trees. 

Theorem 2 (Convergence of tree- valued Moran to Fleming- Viot dynam- 
ics) . For N G N, let IA N be the tree-valued Moran dynamics with popula- 
tion size N, and letU = (Ut)t>o be the tree-valued Fleming-Viot dynamics. 
IfUQ => Uq, as N — > oo, weakly in the Gromov-weak topology, then 

(2.52) U N =^ U, 

N—oo 

weakly in the Skorohod topology on 2?u([0, oo)). 



Remark 2.23. (i) In empirical population genetics, models for finite 
populations rather than infinite populations are of primary interest. 
Theorem [2] allows us to give an asymptotic analysis of functionals of 
the tree- valued Moran dynamics by studying the tree-valued Fleming- 
Viot dynamics which are simpler to handle analytically, 
(ii) The measure- valued Fleming-Viot process is universal in the sense that 
it is the limit point of frequency paths of various exchangeable popu- 
lation models of constant size. (A precise condition, based on a result 
from [MSOlj . is given in (|2.57p .) We conjecture that the same univer- 
sality holds on the level of trees, i.e., the tree- valued Fleming-Viot dy- 
namics is the point of attraction of various exchangeable tree- valued 
dynamics. We will discuss universality as well as different potential 
limit points in more detail in Subsection 12.51 □ 
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2.4. Long-term behavior (Theorem^). Genealogical relationships in neu- 
tral models are frequently studied since the introduction of the Kingman 
coalescent in |Kin 82aj . This stochastic process describes the genealogy of a 
Moran population in equilibrium and its projective limit as the population 
size tends to infinity. In this section we formulate the related convergence 
result for the tree-valued resampling dynamics. 

Recall that a partition of N is a collection p = {vri,^, . . .} of pairwise 
disjoint subsets of N, also called blocks, such that N = U^. The partition p 
defines an equivalence relation ~ p on N by i ~ p j if and only if there exists 
a partition element ir € p with i,j € ir. We denote by § the set of partitions 
of N and define the restrictions p±, p2, ■■■ on S by p\. o p := {71".; D {1, k} : 
7Tj G p}. We say that a sequence pi,p2, ■■■ G § converges to a partition p if the 
sequences p& ° p n converge in the discrete topology to p^ o p, for all feel. 

Starting in Vq = p € §, the Kingman coalescent is the unique S-valued 
strong Markov process %i = (%j)t>o such that any pair of blocks merges at 
rate 7 (see, for example, |Kin82b[ IPit99j ). 

Every realization £ = (kt)t>0 of %, gives a pseudo-metric on N defined 

by 

(2.53) r*(i,j) :=2-inf{< >0: i~^j}, 

i.e., r^(i,j) is proportional to the time needed for i and j to coalesce. Note 
that (N, r^) is ultra-metric and that r^(i,j) can be thought of as a genealog- 
ical distance. Denote then by (Z/,r fc ) the completion of (N, r*-). Clearly, 
(L^r^) is also ultra- metric. Define i^/v to be the map which takes a real- 
ization of the S-valued coalescent and maps it to (an equivalence class of) a 
pseudo-metric measure space by 

(2.54) H N : t ^ (Z*,r*,^ := 

Notice that for each N, the map is continuous. 

By Theorem 4 in |GPW09a] , there exists a U- valued random variable 
such that 

(2.55) Hn^^Uoo, 

N — >oo 

weakly in the Gromov-weak topology. The limit object Uqo is called the 
Kingman measure tree. Since the Kingman coalescent comes immediately 
down from 00, the Kingman measure tree is compact (see [EvaOOj ) . 

Theorem 3 (Convergence to the Kingman measure tree). Let U = 
(Ut)t>o be the tree-valued Fleming-Viot dynamics starting in Uq and 
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the Kingman coalescent measure tree. Then 
(2.56) Ut^Uoo. 

t — <-oo 

In particular, the distribution of Uoo is the unique equilibrium distribution 
of the tree-valued Fleming- Viot dynamics. 



Remark 2.24 (Exchange of limits). Recall from (|2.5ip the tree- valued 
Moran dynamics {U N = (U^)t>o] N G N}. It is straightforward to check 
that for all N £ N and for all possible initial states, ' H 'n {%.) , and 

t — ►oo 

therefore the limits N — > oo (see Theorem [2]) and t — ► oo (see Theorem [3]) 
can be exchanged by (|2.55p . □ 

2.5. Extensions. Several extensions of our construction of the tree- valued 
resampling dynamics are possible. In particular, we see three different direc- 
tions. 

• A re-introduction of genetical types allows for type- dependent evolu- 
tion. In particular, this allows to construct tree- valued resampling dy- 
namics under mutation and selection. Alternatively, if individuals are 
assigned a type describing the location in some geographical space, we 
can model spatial tree- valued resampling dynamics. 

• By taking all individuals ever alive into account it is possible to con- 
struct a random genealogy including besides the current population 
all the fossils. 

• Considering more general exchangeable resampling mechanisms than 
in the Moran model above leads to various different tree-valued limit 
dynamics. 

The first two extensions require the introduction of extended state spaces 
and thereby a thorough investigation of their topological properties. How- 
ever, the third extension does not and we next list the conjectures which are 
straight forward. 

So far we have considered the tree-valued Moran resampling dynamics 
and their diffusion limit. A general framework for resampling dynamics is 
provided by the models of Cannings (see [Can74} ICan75j and Section 3.3 in 
[Ewe04| for a survey). For a fixed population size N € N, each Cannings 
model is characterized by a family of N-valued exchangeable random vari- 
ables {V^ , V$} with J2iLi = N recording the numbers of offspring 
of the individuals currently alive. After random waiting times the popula- 
tion is replaced by a new population where the i th individual is replaced by 
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V$ new individuals, i = 1, N. We can therefore extend our construction 
to more general exchangeable resampling dynamics. This includes, for ex- 
ample, resampling dynamics which allow for an infinite offspring variance in 
the forward and multiple mergers of ancestral lines in the backward picture. 
In the following we denote the tree-valued Cannings dynamics of size N by 
V N = (V t N )t>o. 

Classifying coalescent processes for exchangeable models, |MS0f j have 
shown that genealogies of the measure- valued Cannings dynamics in discrete 
time, suitably rescaled, converge to the Kingman coalescent which is dual 
to the Fleming- Viot process provided 

We claim that the same results applies to the tree- valued dynamics. 



Conjecture 2.25 ("Light tail" Cannings converges to Fleming- Viot). 
Assume that the offspring distributions satisfy fi?.57| ). Under a suitable time 
rescaling, (V n )n&n converges weakly to the tree-valued Fleming- Viot dynam- 
ics in the Skorohod topology on X\j([0, oo)), as N — > oo. 



There are other regimes where the tree-valued Cannings dynamics con- 
verge but the limit is not a diffusion. In the sequel we focus on continuous- 
time Cannings models with > 1 for at most one i. These are most 
easily described using a measure A € Aif([0, 1]). The A- Cannings dynam- 
ics for population size N £ N are given by the following two independent 
mechanisms: 

• Each (unordered) pair resamples at constant rate A{0}. 

• For each x € (0,1], each individual kills at rate a binomial 
number Bin(x, N) of individuals (including itself) and simultaneously 
replaces the gaps by the same number of copies of itself, 

where Bin(x, N) denotes a random variable with binomial distribution with 
parameters (x,N). In the following we denote by V^'^ the corresponding 
tree-valued Cannings dynamics. 

Next, consider the limit of infinite population size for the Cannings model. 
The long-term behavior of the genealogy of the Cannings model is given by 
A-coalescents, introduced in |Pit99| . It is known that A-coalescents can be 
associated with the A-coalescent measure tree, provided the dust-free prop- 
erty holds, i.e., 

(2.58) f 1 A(ds) x~ l = oo, 

Jo 
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(compare, Lemma 25 in |Pit99j and Theorem 4 in |GPW09a] ). 

Conjecture 2.26 (A-Cannings converge to A-resampling dynamics). 
Assume that the dust-free property (|2.58p holds and that there exists a ran- 
dom element V A with Vq' A => Vq, as N — > oo. 

(i) There exists a V-valued process V A £ Pu(R+), w't/i V^> =>■ V A , as 
iV — > oo. 

(ii) The process V A is Feller and strong Markov and has the A-coalescent 
measure tree as its unique equilibrium. 

In the following we refer to the process V A as the tree-valued A-resampling 
dynamics. 

Note first that A(0, 1] > implies that a substantial fraction of the pop- 
ulation is replaced by one individual. Hence we expect continuous paths if 
and only if A = 7<5q, for some 7 > 0. 

Conjecture 2.27 (Continuous paths versus jumps). The following are 
equivalent. 

(i) The process V A has continuous path. 

(ii) A = 7<5o, for some 7 > 0. 

(Hi) The process V A is the tree-valued Fleming-Viot dynamics. 

When an individual replaces a fraction of the population, the measure 
has an atom. Nevertheless, tree growth destroys this atom immediately. 

Conjecture 2.28 (V A has no atoms). For Lebesgue almost all t, the 
metric measure space V A has no atoms. 

Moreover, it is known that the A-coalescent comes down from infinity in 
infinitesimal small time if and only if 



00 



(2.59) 



b=2 



with 



(2.60) 




(see, Theorem 1 in [SchOOb] ). 
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Since balls do either agree or are disjoint in ultra-metric spaces, the num- 
ber of ancestral lines a time e > back equals the number of 2e-balls one 
needs to cover the ultra-metric space representing the coalescent tree. Hence 
we expect the following. 

Conjecture 2.29 (Compactness and "coming down from infinity"). The 
following are equivalent. 

(i) For each t > 0, Vf € U c , almost surely. 

(ii) The measure A € 1]) satisfies Condition 12.59\) . 

We complete this subsection by pointing out that there are more gen- 
eral resampling dynamics in which > 1 for more than one index i 6 
{1,...,N}. In this case the genealogical trees are described by coalescent 
processes allowing for simultaneous multiple mergers (see, [SchOOaj ). 

Outline. The rest of the paper is organized as follows. As an applica- 
tion we study the evolution of subtree length distributions in Section 
A duality relation of the tree-valued Fleming- Viot dynamics to the tree- 
valued Kingman coalescent is given in Section SJ In Section [5] we give a 
formal construction of tree-valued Moran dynamics using well-posed mar- 
tingale problems. The Moran models build, as shown in Section [6j a tight 
sequence. Duality and tightness provide the tools necessary for the proof of 
Theorems Q] through O which are carried out in Section In Section [9] we 
give the proofs of the applications of Section [3l 

3. Application: Subtree length distribution (Theorems l4l and |5|). 

In this section we investigate the distribution of the vector containing the 
lengths of the subtrees spanned by subsequently sampled points, which is 
referred to as the subtree length distribution. All proofs are given in Section^ 
The main result in Subsection 13.11 is that the subtree length distribu- 
tion uniquely determines ultra- metric measure spaces. In Subsection 13.21 the 
corresponding martingale problem and its well-posedness is stated. In Sub- 
section 13.31 we study with the mean sample Laplace transform a special 
functional of the subtree length distribution. This, in particular, allows for 
a description of the evolution of the number of segregating sites in a sample 
under a neutral infinite sites model. 

3.1. The subtree length distribution (Theorem^). Recall from Remark l2.2l 
that we can isometrically embed any ultra-metric space (U, rjj) via a func- 
tion ip into a path-connected space (X, rx ) which satisfies the four-point 
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condition (j2.8|) such that X \ X° is isometric to (U,ru). For a sequence 

ui, ...,u n € U with n £ N, let 

(3.1) 

4^) ({«!,...,«„}) :=4 JC ^(M« 1 ),...^(« n )}) 

:= length of the subtree of (X, r) spanned by {(^(ui), </?(it„)}, 

where for an K-tree (X, rx ) with finitely many leaves the length of the tree 
is defined as the total mass of the one-dimensional Hausdorff measure on 
(X,B(X)). 

Note that the length of the tree spanned by a finite sample is a function 
of their mutual distances as we state next. 

Lemma 3.1 (Total length of a sub-tree spanned by a finite subset). For 
a metric space (X,rx) satisfying the four point condition (12.8j) and for all 
X\ , .. . , x n <E X , 

1 n 

(3.2) 4 X ' rx) ({zi> Xn}) = » mf { r ( X i> ^W); ° ^ ^n}) 

^ i=l 

where := {permutations of {1, ...,n} wi/i one cycle}. 

To specify the distribution of the length of the subtrees of subsequently 
sampled points we consider the map 

(3.3) I : I M + ^ ~> R + 

- \ ? ^(o,£ 2 fc),4(£),...), 

where for each n G N, 

1 " 

(3.4) ^(r) := - inf { £ r i>ff(i) ; a G £*}. 

We then define the subtree length distribution of u G U by 

(3.5) £(») :=|X GA4i(RN). 

The first key result states that the subtree length distribution uniquely 
characterizes ultra-metric measure spaces. 

Theorem 4 (Uniqueness and continuity of tree lengths distribution). 
The map £ : U — ► .Mi(M5) /rom ()3.5p is injective and continuous (where 
A4i(R+) is endowed with the weak topology and with the product topol- 
ogy)- 
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Remark 3.2 (£(U) is complete and separable). We endow the space 
£(U) C ,Mi(R+) with a complete metric induced by ^. For this purpose, 
take a complete metric dP(.,.) on U and set for Aj G £(U), i = 1,2, using 
the injectivity of £ 

(3.6) ^ (U) (Ai,A 2 ) :=du(r 1 (Ai),r 1 (A 2 )) 

It follows from completeness and separability of U that £(U) is also complete 
and separable, i.e., £(U) is Polish. □ 

Remark 3.3 (Conjecture about general tree spaces). Theorem 0] shows 
uniqueness of the tree-length distribution on the space of ultra-metric spaces. 
We are not aware of a counter-example, showing that injectivity fails to hold 
for metric measure spaces satisfying the four-point condition (|2.8p . □ 

3.2. Martingale problem for the subtree length distribution (Theorem^). 
We investigate the evolution of the subtree length distribution under the 
tree- valued Fleming- Viot dynamics. That is, given the tree- valued Fleming- 
Viot dynamics U = (Ut)t>o, we consider 

(3.7) S = (S t ) t > , E t :=£(W t ). 

To describe the process S via a martingale problem, we define the operator 
on the algebra IP := {^o^ 1 : $ G 11} with domain II 1 ' 3 := {^o^ 1 : 
$ G n 1 } by 

(3.8) ^S($ e- 1 )(A):=0T$(r 1 (A)), 
for all A G £(U). 

In II > a we find, in particular, functions ^ G II ' a which are of the form 

(3.9) *V> (A)= f X(dl)^(£), 

for a test function tp G C^(R^). 

The main result of the section is the following. 

Theorem 5 (The subtree lengths distribution process) . For Po G M i (U) , 
let U be the tree-valued Fleming- Viot dynamics with initial distribution Pq. 
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(i) The (£*Po, fif'", II 1 '") -martingale problem is well-posed in £(U). Its 
unique solution is given by E = (£t)t>o with E$ = £(l4t)> f or t > 0. 
The process E /ios i/ie Feller property. In addition, it has continuous 
sample paths, P -almost surely. 

(ii) The action of£V ,a on a function of i/ie /orm (|3.9|) zs given oy 

(3.10) ^< S ^'(A) : = ^ „(A, A^,) + 7 ^ n ( A , ^ o /3 n - V>. 

n>2 ° ln n>l 

where j3 n : — > is given by 

(3.11) /?„: (4 = o, f 2 , f 3 , ...) ^ (^i = 0, .., 4-1, 4, 4, 4+1, •••)■ 



3.3. Explicit calculations. We consider in this section the mean sample 
Laplace transforms, i.e., functions of the form (|3.9j) with test functions of 
the form 



(3.12) 



-(Tin 



for some n £ N and <r G K+ in (|3.9|) for each n G N. Using (|3.10p we obtain 
the following explicit expressions. 



Corollary 3.4 (Mean sample Laplace transforms). Let E = (Et)t>o 
6e t/ie solution of the (£*Po, II 1 '") martingale problem. For all a € M+ 
and n > 2, set 



(3.13) 

Then, 

(3.14) 

3 n (^) 
_r(n)r(fa + l) 



<7 n (i,<7) :=E 



/ S t (d|), 



-Tin 



l r » (ri)(-i) fc (^+2fc-i) 

T(fa + n + /c) 



m=2 



m 



«T(0;a)) 
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In particular, if g n {a) = lim^oo g n (t; a) then 
(3.15) g n (a)=B[e-^'^ £k ], 



where {£ ; k = 2, ...,n} are independent and £ is exponentially distributed 
with mean ^^-i) > k = 2, n. 



Remark 3.5 (Length of n-Kingman coalescent). Consider the Kingman 
coalescent started with n individuals, and let L n denote the total length of 
the corresponding genealogical tree. Note that (|3.15p implies the well-known 
fact (implicitly stated already in [Wat 75] ) that 



(3-16) L n=YT k=2 £k 



The analog results for more general A-coalescents can be found in [DDSJ08"! 
ID1MR07| . □ 



Example 3.6 (Tree length and number of segregated sites). In empir- 
ical population genetics genealogical trees of samples are of great interest 
(e.g., |Ewe04] ) . Consider in addition to the Fleming- Viot dynamics state- 
independent neutral mutations at rate > 0. Under the assumptions of 
the infinite sites model every mutation on the sampled tree can be seen in 
data taken from the sampled individuals and the total number of observed 
polymorphisms (segregating sites) equals the total number of mutations on 
the sample tree. Given the length ({iti, u n }) of the sub-tree spanned 
by the sample {u±, ...,u n } the number of segregating sites, denoted by 
(3.17) 

S n ({ui, ... ,u n }) 

:= ^{mutations which fall on the sub-tree spanned by {ux, ...,u n }}, 

is Poisson distributed with parameter ■■■,u n }). In particular, if 

U = (Ut)t>o, Ut = (Ut,r t ,fit) is the tree-valued Fleming- Viot dynamics, 

(3.18) E[J fif n (d( Ul ,...,u n ))e-^ u ^--' u ^] =g n (t;%(l-e-°)). 

for a € M+, where g n as defined in (|3.13p and explicitly calculated in Corol- 
lary GTS □ 
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4. Duality. If applicable, duality is an extremely useful technique in 
the study of Markov processes. It is well-known that the Kingman coales- 
cent is dual to the neutral measure-valued Fleming- Viot process (see, for 
example, [Daw93, IEth01| ). In this section this duality is lifted to the tree- 
valued Fleming- Viot dynamics. We will apply the duality to show uniqueness 
of the martingale problem for the tree-valued Fleming- Viot process and its 
relaxation to the equilibrium Kingman measure tree in Section [8l 

The dual process. Recall from Subsection 12.41 the Kingman coalescent 
K = {"Ki)t>o and its state space § of partitions of N. Since we are constructing 
a dual to the U- valued dynamics, we add a component which measures 
genealogical distances. The state space of the dual tree-valued Kingman 
coalescent therefore is 



(4.1) K:=Sx mV2 



+ ' 



o 

equipped with the product topology. In particular, since § and are 
Polish, K is Polish as well. 

In the following we call the K- valued stochastic process K, = (/C<)t>0i with 

(4.2) JC t = (^,^) 

the tree-valued Kingman coalescent if it follows the dynamics: 

Coalescence. % = {%t)t>o is the §-valued Kingman coalescent with 
pair coalescence rate 7. 

Distance growth. At time t, for all 1 < i < j with i ^. Kt j, the 
genealogical distance r[(i,j) grows with constant speed 2. 

To state the duality relation it is necessary to associate a martingale 
problem with the tree-valued Kingman coalescent. Consider for p 6 §, the 
coalescent operator k p : p 2 — ► § such that for n, it' € p, 

(4.3) «,(7ry) := ( P \ {tt, tt'}) U {tt U tt'}, 

i.e., k p sends two partition elements of the partition p to the new partition 
obtained by coalescence of the two partition elements into one. 
We consider the space 

(4.4) Q := {G e B(K) : G(-,r') € C(S),Ve' G R^} 
and the domain 

(4.5) G 1 ' :={Geg: div r '(G) exists, V P G S} 
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with 

(4-6) di<-E^- = 2 £ ap- 

We then consider the martingale problem associated with the operator 
& on with domain g 1 ' , where ft* := fti>g row + fti> coal , with 

(4.7) ^ row G( P ,i'):=div;'(G)( P ,i') 
and 

(4.8) ^ coal G( P ,£'):=7 E (%(^^),l')-G( f ,r')). 

Fix Po € A^i(K). By construction, the tree- valued Kingman coalescent 
solves the (Po, f^, C?)-martingale problem. 

The duality relation. We are ready to state a duality relation between the 
tree- valued Fleming- Viot dynamics and the tree- valued Kingman coalescent. 

To introduce a class H of duality functions, we identify every partition 
P € S with the map p which sends i G N to the block tt G p iff i G ir, and 
put for p G S, 

(4-9) (ZLK := ( r min ;>(i),min ;>(.;')) i<j<j- 

Let then for each n G N and </> G C^(R\ 2 ) depending on the coordinates 
( r «j')i<«<i<« onr y> the function i7 n '^ : U x K — > R be defined as 

(4.10) ^{u, ( P ,r!)) := I 'v'(di)<j>((r)> +£')• 
Notice that then the collection of functions 

(4.11) w= {tf n '^(.,0 : neN,^K,^crt} 

is equal to II 1 , and thus obviously separates points in .Mi(U). 

Proposition 4.1 (Duality relation). For P G A^i(U), (6 1, letU = 
(Mt)t>o and K. = (1Ct)t>o be solutions of the (Po,^,!! 1 ) and (6^,0.^,0)- 
martingale problem, respectively. Then 

(4.12) E[H(U t ,H)]=-E[n(u,Kt)], 
for allt>0 and H £?{. 
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Proof. We shall establish that for H n '^ G H, 
(4.13) tfH n >*(., *.)(«) = &H n 'U(», -)0O 

Using the fact that H n ^ is bounded the assertion then follows from Theo- 
rem 4.4.11 (with a = (3 = 0) in [EK86j . 

We will verify (|4.13|) for the two components of the dynamics separately. 
Observe first that by (f!T30l) and (|37j) . 



^ T ' grow ^(-,(p,i'))(") = 2- /V(dr) X! 

(4.14) 



i<i<j 

z/ M (dr) div^' (</>)( {zY +r!) 

where we have used in the second equality that 4>({r) p +r') = 0, whenever 
i ~ P J - 

Similarly, using 9k i from (|2.32p . 



(4.15) 



Kk,l 



7 y ^"(di) (^(fc) Kp( " y) - <He ? +e0) 



Combining (|4.14p with (|4.15p yields (|4.13p and thereby completes the 
proof. □ 



5. Martingale problems for tree-valued Moran dynamics. Fix 

N G N, and recall from (I2.51|) in Subsection 12.31 the tree-valued Moran 
dynamics 1A N = (U^)t>o of population size N. In this section we will char- 
acterize the tree- valued Moran dynamics as unique solutions of a martingale 
problem. We will then use an approximation argument to establish the ex- 
istence of the solution to the Fleming- Viot martingale problem. 

Notice that the states of the tree- valued Moran dynamics with population 
size iV are restricted to 



(5.1) 



\J N := {« = (U,r,n) eU: Nfi G N(U)} C U C) 
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where M(U) is the set of integer- valued measures on U. Moreover, if u 6 U/v, 
then u can be represented by the pseudo-metric measure space 

N 

(5.2) ({1,2, ....JV},^- 1 ^), 

i=l 

for some pseudo- metric r' on {1, . . . ,N}. In the following we refer to the 
elements i 6 {1, 2, iV} as the individuals of the population of size N. 

By construction, the tree-valued Moran dynamics are derived from the 
following particle dynamics on the representative (|5.2[> : 

Resampling. At rate Z > 0, a resampling event occurs between two 
individuals k, I such that distances to I are replaced by distances to k. 
This implies, in particular, that the genealogical distance between k 
and I is set to be zero. Equivalently, the measure changes from fj, to 
^ + jfdk - jjSi- 

Distance growth. The distance between any two different individ- 
uals i,j grows at speed 2. 



Example 5.1 (Distance growth for various N). We illustrate the effects 
of the distance growth for N = 2, N = 4 and large N. Consider the ultra- 
metric space of all leaves in the tree from Figure 12(a) . The numbers indicate 
weights of atoms in /x. After some small time e, this ultra-metric space 
evolves due to distance growth to different trees, depending on N. For N = 2, 
whenever we sample two different individuals, they must be taken from the 
two leaves. Therefore the distance between the two points in the ultra-metric 
space grows. For N = 4, we may sample two (but not more) individuals 
without replacement from the same point in the above tree. We therefore 
may sample two individuals from the same leaf which then splits into two 
branches whose lengths grow. For large N, we may sample a lot of individuals 
from one and the same point and therefore split into many branches whose 
lengths grow. □ 



5.1. The martingale problem for a fixed population size N . In this sub- 
section we characterize the resampling and distance growth dynamics by a 
martingale problem. 

Fix N G N. Similarly as in (|2.3p . for a metric space (U,r), define a map 
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(a) 

1 v ' i 



(b) 




N — 2 



TV = 4 



TV large 



Fig 2. Tree growth in finite Moran models, (a) The starting tree has only two distinct 
points, (b) In a population of size 2 these two points grow in distance while two or more 
individuals can be sampled from the same point for larger TV. 



which sends a sequence of N points to the matrix of mutual distances 

U N -> r(?) 



(5.3) 



R N,(U,r) 



(5.4) 



(xi,...,x N ) h-> (r(xi,Xj))i<i<j<Ar 
For a metric measure space (U,r,n) with 7V/i 6 N(U), let 

_r 1_ 

:= /x(dtti' ' 



— (du 2 ) ® ... <g> — 

1 ~ 



l - 4r 



"k 



(N-2) 



(du N ), 



the sampling (without replacement) measure and define the N distance ma- 
trix distribution (without replacement) u N '( u > r 'M of u = (U,r,fA) € Ujy by 



(5.5) 



U N,» ._ (RNMr^^N e Ml ( R Py 



Observe that u € Ujv is uniquely characterized by its N distance matrix 
distribution. 

Once more, it is obvious that u N ^ u,r '^ depends on (U, r, /i) only through 
its equivalence class (U,r,n) S Ujy leading to the following definition. 
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Definition 5.2 (iV-distance matrix distribution). For N G N, the N 
distance matrix distribution u N,u (without replacement) of u G Utv is defined 
as the N distance matrix distribution v N ^ u > T ^) f an arbitrary representative 
(U,r,/j,) of the equivalence class u = (U,r,fi). 



For a measurable, bounded 4> '■ R+ — > R, introduce the polynomial 
$ = *ft by 

(5.6) *(«) = <i/"-,$ := / ™ i^-(dr)^(r). 

Notice that in contrast to the space of all polynomials the space of all poly- 
nomials of this special form does not form an algebra. Put therefore 

(5.7) U N := algebra generated by {$% : </> G B(M.{^)}, 
and 

(N\ 

(5.8) njy := algebra generated by {<$>% : <f> G C^R^)}- 

We define an operator &> N := ^T.grow.iv + ^t,res,7V on Un whoge action 
on IT^ is given by independent superposition of resampling and distance 
growth. 

( N ) 

We begin with the distance growth operator f^T, grow, at _ L e ^ \ g ]g> Va ) denote 
the matrix with all entries equal to 1, and define 

(5.9) a E (g) :=g + g. 

In periods of length e > without resampling the N distance matrix distri- 
bution changes due to tree growth from u to (o"2 £ i)*z/ ■ Therefore, 

n^,N^ ;= l if nI( ( /,^o ff2ei ) - (^.-,0)) 

(5.10) e ^ ^ 

= <zA",div(0)}, 

with div from (pT22j) . 

For the resampling operator Q^ rcs ' N , consider first the action on a rep- 
resentative (U,r,fi) of the form (15.20 . Any resampling event in which the 
individual I is replaced by a copy of the individual k changes the measure 
from fj, to fj, + jj5 k - jjSi. 
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Therefore, since 

(5.11) £ {R N,(U,r) ) ^ + ^ §k _l Sl )^ N = £ (**,,), 



Kk.KN KkJ<N 



we obtain for u = (U,r, /i) that 
(5.12) 

l<k,l<N 

= 1 E (<^*°M 

l<fc,Z<iV 

It is easy to see that for given N € N, Iljy separates points in U/v- We can 
therefore use the operator (f^.A^njy) to characterize the tree- valued Moran 
models analytically. 

Proposition 5.3 (Tree- valued Moran dynamics). Fix N <G N and let 
Pj^ € A^i(Ujv)- The (Pq ,ft^ ,N , n]y) -martingale problem is well-posed. 



PROOF. Let r in (|230|) be such that the law of ({1, 2, N}, r , ± Ya =1 Si) 
equals P^. Then the tree- valued Moran dynamics defined in (|2.5ip solve the 
(Pq , f^'^, nL) -martingale problem, by construction. This proves existence. 

For uniqueness - following the same line of argument as given in Section U] 
- one can check that the (P^, fi^'^, njy)-martingale problem is dual to the 
tree-valued Kingman coalescent where the duality functions € Iljy are 
smooth polynomials that involve sampling without replacement (see, for 
example, Corollary 3.7 in [GLW05J where a similar duality is proved on the 
level of the measure- valued processes). □ 



5.2. Convergence to the Fleming- Viot generator. The goal of this subsec- 
tion is to show that the operator for the tree- valued Fleming- Viot martingale 
problem is the limit of the operator for the tree-valued Moran martingale 
problems. This is one ingredient for the proof of Theorem [2] given in Sec- 
tion EJ 

Proposition 5.4. Let e n 1 . There exist 3>i £ n},$ 2 € III, ■ ■ ■ suc ^ 
that 

(5.13) lim sup |$at(u) - $(»)| = 0, 

N ~" x 'u&J N 
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and 

(5.14) lim sup |n T,JV $iv(u) -n T $(u)| =0. 

N->oo uGVn 



Proof. First, define the extension operator 
(2) h-> 

( r i,j)l<i<j<N | — > ( r i~ N Aj~ N , r i~jvVi~jv)l<*<J')l<i<ji 



(5.15) tJV : 



where i~Ar := 1 + ((i - 1) mod AT). Fix $ = G II 1 for n G N, ^ G 

C fe 1 (]R\ 2 ''). For JV > n set $jv := G Hjy. By the definition of the iV- 



distance matrix distribution of a representative (|5.5h . there is a C > such 
that 



SUp |$Ar(u) — $(u)| = SUp \(v '", (j) O tjy) — (l/ B , (/>) | 



C 5 16 \ = sup |((ijv)*i/ 

c 

< — 

- AT 

for all N >n. This shows (j5TT3]l . For (fBTTil) observe that T $(«) = 

and £1^ n $n(u) = (u N,u ,tp) for continuous, bounded functions ^ and tp 

satisfying ip = ip o tjy- Hence, (|5.14p follows from (|5.16j) . □ 



5.3. Coupling tree-valued Moran dynamics. In this section we show how 
to couple two tree-valued Moran dynamics. In particular, using a metric 
on ultra-metric measure spaces introduced in |GPW09a] . we show that the 
coupled processes become closer as time evolves (Proposition 15. 8| ), This will 
be an important ingredient in showing the Feller property of the tree- valued 
Fleming- Viot dynamics stated in Theorem [TJ 

We fixJVeN and X := {1, N}. Informally, we couple two tree- valued 
Moran dynamics by using the same resampling events. For this, recall the 
Poisson processes r] = {if' 3 ;i, j G 1} from Section 12.31 which determine 
resampling events. Recall from (|2.49p the notion of ancestors A s (i,t), i G N 
and < s < t. 

Definition 5.5 (Coupled tree- valued Moran dynamics). Let 7q and r$ 
be two pseudo-metrics on I, U^^ := u^f := (I, rft, j? YlfLi Si), k = 1, 2 and 
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rj a realization of the Poisson processes given in Section \2.3[ If we define for 
t > 0, U^' k , k = 1,2 to be given through r^' k , k = 1,2, by ()2.50j) with ro 
replaced by rft, k = 1,2, respectively, then (U t ^ ,U t ,2 )t>o are referred to as 
the coupled tree- valued Moran dynamics started in («i , «^)- 

In order to compare U^' 1 and U^' 2 we use the following metric on U 
introduced in |GPW09al Section 10]. 

Definition 5.6 (Modified Eurandom metric). The modified Eurandom 
distance between u\ = (U\,ri, fi\) and ui = (U\,ri, fAx) G U is given by 

d Eur (ui, wi) 

: =inf/ / V-(d(h,i2))P>(d(ji,h)) |ri(zi,ii) -r 2 (i 2 ,j2)| A 1 
ii Jul Jul 

where the infimum is taken over all couplings of [i\ and fi2, ^.e. , 

(5.18) fx S {Jl' G Mi{U x x U 2 ) : (n k )*ji' = fi k , k = 1, 2}, 

with ir k : U\ x U2 — > Uk denoting the projection on the k th coordinate, 
k = 1,2. 

Remark 5.7 (Connection to the Gromov-weak topology). As noted in 
Proposition 10.5 in |GPW09a] . the distance d' Enr is a metric and metrizes 
the Gromov-weak topology (but is not complete). In particular, for random 
objects U, U±, Uiy in U which are all defined on the one probability space, 
we find that U n if P[d' Euj .(U n ,U)] — > 0, as n — > 00. □ 

Proposition 5.8 (Coupled tree-valued Moran dynamics get closer). 
Let (U t ' l ,Uf. ' V )t>o oe t ae coupled tree-valued Moran dynamics started in 
(u?,u£). Then for all t > 0, 

(5.19) d' Eur (U t N '\U t N > 2 ) < d Eur (u», 4), 
almost surely. 

Proof. We use the notation introduced in Definition 15.51 W.l.o.g. we 
write uj? := (J, r$, jj X) i=1 <^)> k = 1,2 and have 

(5.20) d Em (u» , u?) :=^E \r&iJ) ~ A 1 



36 GREVEN, PFAFFELHUBER, WINTER 

which can be arranged using isometries of or ■ By the definition of 
the coupled tree- valued Moran dynamics (recall also (|2.50|) ) 

(5.21) \rf(i,j) -rf(i,j)\ = \rl(A (i,t), A (j,t)) - r 2 (A (i,t), A (J,t))\. 

In particular, for all t > 0, 

= inf ^J2\ r f (iJ)-rf Hi),a(j))\M 



(5.22) 



< 



N- 



53 \rl(M*,t)^ti,t))-rf(Mht)>A<iti,t))\ Al 



A (i,t)#A (j,t) 



<j^El r o(*.J')-^(i,j)| Al 



hi 

— «Eurl"l ) "2 J' 

as claimed. □ 



6. Limit points are compact. Recall from (j2.51|) the definition of the 
tree-valued Moran dynamics IA N with population size JV£N (compare also 
Figure [I]) . In this section we show that potential limit points of the sequence 
{U N ; N € N} have cadlag sample paths in U and take values in the space 
of compact metric measure spaces for t > 0. In Subsection 16.11 we state a 
sufficient condition for relative compactness in M c . In Subsection 16.21 we 
apply this criterion to show that the sequence of Moran models U N satisfies 
the compact containment condition. 

6.1. Relative compactness in MI C . We give a criterion for a set to be 
relatively compact in M c . In this subsection we are dealing with general (not 
necessarily ultra-) metric measure spaces. We define for t£M the distance 
distribution w K € .Mi(M+) by 

(6.1) w K (A) := u K {g G : r 1>2 G A}, 

for all A € B(R+). 

The relative compactness criterion reads as follows: 

Proposition 6.1 (Relative compactness in M c ). A set F C M c is rel- 
atively compact in the Gromov-weak topology on M c if the following two 
conditions are satisfied. 



imsart-aop ver. 2008/08/29 file: arXiv3.tex date: July 24, 2009 



6 LIMIT POINTS ARE COMPACT 



37 



(i) {w K : k. G T} is tight in M\(R+). 

(it) For all e > there exists N £ G N such that for all ^ £ T and (X, r, /i) G 
^, the metric space (supp(p),r) can be covered by N £ open balls of 
radius e. 



Remark 6.2 (Relative compactness criterion is only sufficient). By The- 
orem 2 of |GPW09a] , (i) is a necessary condition for relative compactness in 
M. Note that (ii) is not necessary for relative compactness in M c : Consider, 
for example, 

(6.2) T = { Kn = ({0,1, ...,n},r eucl ,Bin(n, ^-)) : n € N} C M c . 

Since ^ — > (N, r euc i, <5o), as n — > oo, the set T is relatively compact, but (ii) 
does not hold. □ 

The proof of Proposition 16.11 is based on two Lemmata. Recall that for a 
metric space (X, r) an e-separated setis a subset I'CI such that r(x' , y') > 
e, for all x', y' 6 X' with x' ^ y' . 

Lemma 6.3 (Relation between e-balls and e-separated nets). Fix N € N, 
a metric space (X, r) with j^X > N + 1 and e > 0. The following hold. 

(i) If (X,r) can be covered by N open balls of radius e, then (X, r) has 
no 2e-separated sets of cardinality k > N + 1. 

(ii) If (X,r) has no e-separated set of cardinality N + 1, then (X,r) can 
be covered by N closed balls of radius e. 

Proof, (i) Assume that x\, ...,xn S X are such that X = [jfLi B £ (xi), 
where we denote by B e (x) the open ball around x G X of radius e > 0. 
Choose (N+l) distinct points y±, ...,yN + ± G X. By the pigeonhole principle, 
two of the points must fall into the same ball B E (xi), for some i = 1, N, 
and are therefore in distance smaller than 2e. Hence {y±, yN+i} is not 2e- 
separated. Since yi, ...,yN+i G X were chosen arbitrarily, the claim follows. 

(ii) Again, we proceed by contradiction. Let K be the maximal possible 
cardinality of an e-separated set in (X, r). By assumption, K < N. Assume 
that := {x\, ...,xk} is an e-separated set in (X, r). We claim that X = 
[jf = i B e (xi), where B e (x) denotes the closed ball around x G X with radius 
e > 0. Indeed, assume, to the contrary, that y G X is such that r(y, xi) > e, 
for all i = 1, K, then Sf U {y} is an e-separated set of cardinality K +1, 
which gives the contradiction. □ 
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Lemma 6.4. Fix e > and JVgM. Let t = (X, r, //), = (li,ri,/Ji), 
^2 = (^2) r 2) A*2)i ••• ^ e elements o/M suc/i i/iai ^ — > ^ m t/ie Gromov-weak 
topology, as n — > oo. // (supp(/ii), ri), (supp^X ?~2), ••• can 6e covered by 
N open balls of radius e then (supp(/x),r) can 6e covered by N closed balls 
of radius 2e. 

Proof. Define the restriction operator PN{(fi,j)l<i<j) '■= ( r i,j)l<i<j<N- 
By Lemma I6.3f i). there is no n G N for which (supp(/x n ), r n ) has a 1e- 

/JV+1\ 

separated set of cardinality N + 1. Set i?2 £ := (2e, 00)^ 2 /. Notice that 
pJ f \ 1 (B2e) is open. Moreover, (supp( / u),r) has a 2e-separated set of cardi- 
nality N + 1 if and only if (pAr+i)*^^-^^) > 0. However, 
(6.3) < { Pn+ i)^{B 2e ) < Kmmt(p N+1 )^(B 2E ) = 

by Theorem 5(b) in |GPW09a] together with the Portmanteau theorem, 
therefore (pN+i)*v K (B 2e ) = 0. By Lemma IfHfll u) , (supp(/i),r) can therefore 
be covered by N closed balls of radius 2e. □ 

Proof of Proposition 16. 1L Assume (i) and (ii) hold for a set T C M c . 
First note that by Theorem 2 in |GPW09a] the set T is relatively compact in 
M. It remains to show that every limit point of T is compact. To see this take 
t G M and to. , K2, ■ ■ ■ £ P such that ^ n — > t in the Gromov-weak topology, 
as n — > 00, and let e > 0. By Assumption (ii) together with Lemma 16.41 
(supp(/i),r) can be covered by N £ / 2 closed balls of radius e. Therefore, k. is 
totally bounded which implies x. G M c , and we are done. □ 

6.2. The compact containment condition for Moran models. The follow- 
ing result is an important step in the proof of tightness of the family of 
tree- valued Moran dynamics. Recall the distance distribution w u from (16. ip . 
The next result states that the family {U N ; N G N} satisfies the following 
compact containment conditions. 

Proposition 6.5 (Compact containment). Let for each N G N, U N = 
(Ut)t>o be the tree-valued Moran dynamics with population size N. 

(i) Assume that the family {w^n : N G N} is tight in Ali(K+). Then, 
for all T > and all e > 0, there exists a compact set T Ei t C U c such 
that 

(6.4) wf P{U t N G r e , T for all t G [e, T] } > 1 - e. 
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(ii) Assume that the family {Uq)n=i,2,... is tight in Aii(\J). Then, for all 
T > and all e > 0, there exists a compact set Q U such that 

(6.5) hrf P{wf € f £)T /or a// * 6 [0, T]} > 1 - s. 

The proof relies on the graphical representation of the tree- valued Moran 
dynamics as illustrated in Figure HJ Recall from (I2.49|) the ancestor A s (i,t) 
back at time s < t of the individual i & {1, 2, iV} living at time t. Denote 
by 

(6.6) S?(t):=#{A t _ e (i,t): i 6 {1, 2, N}}, 

the number of ancestors a time e > back of the population of size N at 
time t. 

Lemma 6.6 (Uniform bounds on the number of ancestors). 

(i) There exists a constant C € (0, oo) such that for all t > and e € 

(o,£/\t), 

(6.7) sup P{S?(t) > £- 4/3 } < Ce 2 . 
JVeN 

(ii) For T > 0, there exists a constant C = C(T) € (0, oo) such that for 
allee (0,|aT), 

(6.8) sup P{ sup Sf (t) > e- 4 / 3 } < CTe. 

iVsN te[e,T] 



Proof, (i) W.l.o.g. we assume that N > e 4 / 3 . Consider the times 

(6.9) (t) := t A inf {e > : 5f (t) < jfc}, 

for = 1,2,.... Since any pair of individuals coalesces at rate 7, T^ is 
stochastically bounded by the sum of {N — k) independent random variables 
Xtv, -Xjfe+i, where Xj is exponentially distributed with parameter 7Q), for 
i = k + 1, iV. Hence, for /c = 1,2, 
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and 

(6.11) Var[rf (t)] < -L ^(2) 

for some C > which does not dependent of i. Here we have used that, for 
any real-valued random variable X with second moments and all t G R, 

Varpf At] = Vslt[(X - E[X}) A (t - E[X])] 
<E[((X-E[X])A(t-E[X])) 2 } 
< E[(X - E[X]) 2 
= Var[X]. 

3 



(6.12) 



Thus for all e < ^ A t, 
(6.13) 

P{5f( i )> e - 4/3 }=P{^-4/3 J W> £ } 



< P{Tf_ 4/3J (t) - E[Tf_ 4/3J (*)] > e - § L^r 1 } 
Var[Tf_ 4/3J (t)] 



(e-^J- 1 ) 2 

<Ce 2 , 

for some C > 0. 

(ii) Fix TV G N, T > and e G (0, ^ A T), and let t G [e, T]. Observe that 
for all s G [0, t] and 5 > such that [s — <5, s] C [t — e, t] , 



(6.14) S?(s)>S?(t), 
almost surely. Hence, for all k > 0, 

(6.15) sup sf (t) < s^m: 

*e[fe|,(fc+i)|] 



Thus 



P{ sup Sf (i) > e" 4 / 3 } < P{ sup ^ 2 (fc§) > e" 4/3 } 

te[e,T] fc=l,...,f2T/e] 



(fU6) < E p{^ 2 (^|)> e - 4 / 3 } 

fc=i 

< 2CTe, 

and (EHD follows. □ 
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Proof of Proposition I6.5L In the whole proof we fix T > and e > 0. 
W.l.o.g. we take e < jq /\ T and set e k := 2Ur2~ k with C = C(T) from 
Lemma 16. 6f ii). 

(i) For u = (U, r, /i) G U, denote by 
(6.17) 

:= min {N f G N : (supp(/i), r) can be covered by N balls of radius e}. 

By assumption, the family {w u n : N G N} is tight, and we can therefore 
choose a constant C £ > such that 

(6.18) w u n ([C £ ; oo)) <e, 
for all N en. Put 

(6.19) T EtT :=Tl T nTl T , 
where for i = 1,2, 

oo 

(6.20) r*, T := H r l,T,k 

k=l 

with 

(6.21) := {, G U c : w u {[C £k + 2T; oo)) < e k ), 
and 

(6.22) r e 2 T , fc := {u G U c : 2V 2£fe («) < e^ 4/3 }. 

Then is relatively compact in U c equipped with the Gromov-weak topol- 
ogy, by Proposition 16.11 

We next show that (|6.4p holds. Notice first that (|6.18p implies that 

(6.23) sup w u n{[C £ + 2T; oo)) < e, 

NeN,0<t<T ' 

for all e > 0, almost surely, since for any pair of two points the distance at 
any time t > is not larger than 2t plus their initial distance. Hence, 

(6.24) P{U t N G T for all t G [0, T]} = 1. 
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Moreover, we find that for all N G N, 

p{wf e r* Tjfc for aine[ £fe ,r]} 
= i-p( |J {iv 2£fc (wf)> £ - 4/3 }) 

(6.25) * e[£fc ' T] 

= 1-P{ sup S»{t) > e fc ~ 4/3 } 
*e[E fc ,T] 

> 1 - CTe fc 
by Lemma l6.6( ii). Hence 

P{U t N G r^ T for all t G [e,T]} 

oo 

(6.26) > 1 - J2 P i U t N i T l.T,k for at least one * G [e fc , T]} 

fe=i 



> 1 - - 

— 1 2' 



and we are done. 



(ii) The family {Utf := (Utf ,if , H$);N G N} is tight by assumption. 
That is, for e > there is a compact set r e C U such that inf jveN P^o^ £ 
rj > 1 - e. 

By the characterization of compactness given in Part(c) of Theorem 7.1 
in [GPW09a] a subset T C U is pre-compact if and only if for all r\ > there 
exists a number N„ G N such that for all (U, r, fi) G u G T there is a subset 
CU with 

• M(C^) > 1 - TJ, 

• C/jj can be covered by at most N v balls of radius rj, and 

• U„ has diameter at most N v . 

Hence, if e, r\ > are fixed, we can find a number N e ^ such that for all N G N 
the following event occurs with probability at least 1 — e: if (Uq^ , r$ , G 
Uq there is V £tV = {m, u Ne ri } C Uq such that fiff (B V (V £)V )) > 1 - 77 
where 

(6.27) B„(y) := |J B„(x) 

for F C Uq . For /c G N we set N k : = 7V efc £ 2 and for TV G N we take 



V N>k C C/^ with #VV,fc = N k such that 
(6.28) P{^(B e2 (V Njk )) > 1 - 4} > 1 - e k . 
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We consider the set 

(6.29) f £ :=ri iT n(r £ 2 T ur £ 3 ) 

with 1^ rp and T from (|6,19p , and let 

oo 

(6.30) T e := P| r £ 3 k , 

k=l 

and 

(6-31) 

T% k :={{U,r^) GU: 3V C U, #V = N k , fi{B 2£k (V)) > 1 - e k } . 

The set T e is compact in U as a finite union of compact sets. Furthermore, 
recalling the notion of an ancestor Ao(t, u) of u living at time t back at time 
from QZaSD , 
(6.32) 

liminfPJWf € r^ rfc for all t G [0;e fc ]} 

AT— »oo 

>l-limsupP{ inf sup ^ (B 2Ek (V)) < 1 - s k \ 

N^oc ^telO,Sh]vCUN,\V\=N k 

> 1 -limsupP( inf M f {B £k {{u : A (u,t) G B £fe (Fjv )fc )}) < 1 - e k \ 

> 1 - limsupPf sup -J-#{u G E/f : A (M) g B Eh (V N , k )} > e k \ 

>1-P{ sup Zf* >e fe } 
te[o )6fc ] 

>l-e k - 1 P[Zll]=l-e k 

with denoting the Wright-Fisher diffusion starting in z G (0, 1), i.e., 
the unique strong solution of Zf = z + Jq ^yZf (1 — Z*)dB s . In the above 
calculation we made use of the following: in the second inequality that at 
most N k balls of radius 2e k in E/ f , < t < e k , cover the descendants 
of B £k (V]y ik ); for the forth inequality the fraction of individuals at time t 
whose ancestors back at time don't live in the e|-neighborhood of V^ :k 
evolves as the fraction of one of the two alleles of a two-allele Moran model 
which is known to converge weakly as a process towards the Wright-Fisher- 
diffusion which starts in e\. In the fifth inequality we used Doob's martingale 
inequality and the Chebychev inequality. 

Combining ([OKI) with flOU yields that 
(6.33) 

oo 

lim inf P{U t N G Tl T U T eT for all £ G [0; T]} > 1 - £ 2e k > 1 - ±- 
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which together with (|6.24j) gives the claim. 



□ 



7. Limit points have continuous paths. It is well-known that the 
measure- valued Fleming- Viot process has continuous paths (e.g., |Daw93] ) . 
In this section we show that the same is true for the tree-valued Fleming- 
Viot dynamics by controlling the jump sizes in the approximating sequence 
of Moran models. 

Recall from (|2.5ip in Subsection 12.31 the definition of a Moran model IA N 
of population size N G N. 

Proposition 7.1 (Limit points have continuous paths). If U N => U 

JV-.00 

for some process IA with sample paths in the Skorohod space, X\j([0, oo)), of 
cadlag functions from [0, oo) to U, then IA £ Cu([0, oo)), almost surely. 

Remark 7.2 (The Gromov-Prohorov metric). In the proof we need a 
metric generating the Gromov-weak topology on M. We use the Gromov- 
Prohorov metric dcPr defined as in [GPW09a] ): For ^, y £ M with (X, rx, ^x) £ 
!C and (Y,r Y ,fJ>Y) £ y, put 

(7.1) d GPr (pc,y) := inf d!^/ z \ip x )*^x-, ((Py)*Vy), 

where the infimum is taken over isometric embeddings ipx and c/?y from 
supp(fix) and supp(/iy), respectively, into some common metric space (Z, rz), 
and the Prohorov metric on (Z, rz) is given by 

(7.2) 

dp r (u, v') = inf {e > : v{A) < u'(B £ (A)) + e, for all closed AC Z}, 

where B £ {A) is defined as in (I6.27|) . As shown in Theorem 5 of [GP W09a] . 
the Gromov-Prohorov metric generates the Gromov-weak topology. □ 



Proof of Proposition 17.11 Recall from Section 12.31 the construction 
of the tree-valued Moran dynamics IA N based on Poisson point processes 
{rf' J ; 1 < i,j < N} (compare also Figure [1]). Note that the tree-valued 
Moran dynamics have paths in X\j c (IR + ), almost surely. 
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If 7f.i{£} = for all 1 < i ^ j < N, then = U t N . Otherwise, if 
rf' J {t} = 1, for some 1 < i ^ j < N , then 

d GPl (U t N _,U t N ) 



N N 

= dGPr(({l, 2, N}, rl ,1^4), ({1, 2, N}, rl±Yl *k) 

k=l k=l 

(7.3) N N 

< 4ia...^) ( 1 s fc, ^ _ + 1 s 4 ) 

fc=i fc=i 

2 

and therefore 

(7.4) / dTe" T sup d GPl (U t N „,U t N ) < - 
Jo te[o,T] iV 

for all T > and almost all sample paths IA N . Hence the assertion follows 
by Theorem 3.10.2 in |EK86j . □ 



8. Proofs of the main results (Theorems [H [2], [3]). In this section 
we give the proof of the main results stated in Section [2j Theorems [T] and [2] 
are proved simultaneously. 

Proof of Theorems Q] and [21 Fix P e M\(V). Recall, for each N € 
N, the state-space Utv, and the U at- valued Moran dynamics, U = (U^)t>o, 
from (|5.ip and (|2.5ip . respectively. Furthermore, let for each N € N, Pj^ G 
■Mi(Ujv) be given such that P^ =>• Po> as iV — > oo. 

By Proposition [531 the (P^, fjT'^, n^) -martingale problem is well-posed, 
and is solved by U . Proposition 15.41 implies with a standard argument 
(see, for example, Lemma 4.5.1 in |EK86j ) that if IA N =>■ U, for some IA € 
£>u([0, oo)), as N — > co, then W solves the (Po, fi^, Il 1 )-martingale problem. 
Hence for existence we need to show that the sequence {W^; iV £ N} is 
tight, or equivalently by Remark 12.91 combined with Remark 4.5.2 in |EK86| 
that the compact containment condition in U holds. However, the latter was 
proved in Proposition I6.5f ii). 

By standard theory (see, for example, Theorem 4.4.2 in |EK86] ). unique- 
ness of the (Po, Q', n 1 )-martingale problem follows from uniqueness of the 
one-dimensional distributions of solutions of the (Po, fi , n 1 )-martingale 
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problem. The latter can be verified using the duality of the tree-valued 
Fleming- Viot dynamics to the tree- valued Kingman coalescent, K, := 
{K-t)t>o, as defined in ()4.2j) . That is, if U = {Ut)t>o is a solution of the 
(Po, £l\ n 1 )-martingale problem, then (|4.12p holds for all k G K, t > and 
H € 7Y. Since H is separating in M.i(U) by Proposition I4.1( i). uniqueness 
of the one-dimensional distributions follows. 

So far we have shown that the (Po, O, IIi)-martingale problem is well- 
posed and its solution arises as the weak limit of the solutions of the 
(Po* , £~l^ ,N , n]y)-martingale problems. In particular, the tree-valued Moran 
dynamics converge to the tree- valued Fleming- Viot dynamics. Hence we have 
shown Theorem [1] and Theorem [2l □ 



Proof of Proposition 12.151 (i), (ii) The tree- valued Fleming- Viot dy- 
namics is the weak limit of tree-valued Moran dynamics. Hence, Proposi- 
tions I6.5H ) and 17.11 imply that the tree- valued Fleming- Viot dynamics have 
values in the space of compact ultra-metric measure spaces for each t > 
and have continuous paths, respectively, almost surely. 

(iii) Recall the functions g n (t,a) from (|3.13|) . Note that for 1 < i < j, by 
exchangeability, 
(8.1) 

E[u Ut {g : r itj = 0}] = J^Ef J u Ut (dg) e~ ari - 2 ] = \m^g 2 (t,a) = 0, 

where we have used explicit calculations for the mean sample Laplace trans- 
form which we presented in more generality in Corollary 13.41 By dominated 
and monotone convergence, 
1.2) 

- r°° ,, _ m\ i « . r T 

E 

i<i<j 



/ dtv Ut (C(0,oo)(2>) < V lim / dt E[v Ut {r : n j = 0}] = 0, 

^0 J i^„ T ^ oo "'0 



almost surely, and hence, almost surely, v Ut ((0, oo)^ 2 )) = 1 for Lebesgue 
almost all t G R+. □ 



Proof of Proposition 12.161 Note that the strong Markov property 
follows from the Feller property by general theory, |EK86l Theorem 4.4.2]. 
Let U u = {U^)t>o be the solution of the (5 U1 ii*, n 1 )-martingale problem, i.e. 
the tree- valued Fleming- Viot dynamics, started in Uo = u. For the Feller 
property, it suffices to show that u — > u implies that Ul ==>■ Ul for all 
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ii £ D and t > 0. Recall the coupled tree- valued Moran dynamics from 
Section 15.31 For u, u' GO, take un,u' n £ Uat with —* u and u' N — > «' in 
the Gromov-weak topology. Let (U^' 1 ,U^ ,2 )t>o be the coupled tree- valued 
Moran dynamics, started in (un,u' n ). As in the proof of Theorems Q] and [2j 
the family of coupled tree-valued Moran dynamics {(U^' 1 ,U^ ,2 )t>o; N € N} 
is tight. Let (U^,Uf )<>o be a weak limit point which must be a coupling 
of tree-valued Fleming- Viot dynamics by construction. Moreover, since the 
modified Eurandom metric (see Definition [52]) is continuous in the Gromov- 
weak topology and bounded 

E[d Ew (U t ",Uf)] = Jim E[d' Em .(U t N '\Ut N ' 2 )} 
(8.3) < lim d' Eur (u N ,u' N ) 

by Proposition ^. 8i In particular, u' n — > u, as n — > oo, implies that Ei[d' Em: (U^ ,Ul 
0, as n — > oo, which in turn implies Z/^' n ==>■ U£ , as n — > oo, by Re- 
mark 15.71 □ 



Proof of Corollary 12.181 For $ = as in the Corollary, observe 
that (u u ,(j)) 2 = (u 11 , (<p,(j)) n } with (0, </>) n from (12.27|) . Therefore, given = 
u, we compute 
(8.4) 

d($(W))t = ft T 3> 2 («) - 2$(u)ft T $(») 

= (i/",div((<M) n ) -2(0,div(0))„) 

n 

fc,«=l 
n 

fc,/=l 

and the result follows from the first two terms vanishing and 

n n 

(8.5) k,l=i k,l=l 

= n 2 (v u , (4>, 4>) n o 9 x , n +i) 

with the symmetrization </> introduced in Remark I2.8f ii) and $ n >^ = $ n >^. 

□ 
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Proof of Theorem [3l In order to prove Theorem [3] we need two in- 
gredients: 

• The family {U t ; t > 1} is tight 

• E 5 «[$(W t )] -» E[$(Z4o)], as t -> oo, for all $ 6 n 1 and u 6 U. 

Then, Theorem [3] follows from Lemma 3.4.3 of |EK86| . 

We show tightness of {Ut; t > 1} in U using Theorem 3 and (3.3) of 
|GP W09a] . First, recalling (|6.1|) and when E[u^/ f ] is the first moment mea- 
sure of wu t € Mi(Mi(R + )), for t and C > 0, 



(8.6) EK4]((C;oo)) 



f e-^E[wuMC-t;oo)), C>t 
e~~t c , C <t. 



So, for given e > 0, choose C > large enough such that 
E[u> Wo ]((C;oo)) < e and e~^ c < e. Then, E[w Wt ]((2C; oo)) < e for alH > 
and so, {E[u;^ t ],t > 1} is tight. 

Secondly, for Ut = (Ut,rt, fMt), we have to show that for < e < 1 there is 
5 > with 

(8.7) sup E[m{u : n(B e (u)) < 5}} < e. 
t>i 

Note that the probability on the left hand side does not depend on t. 
Using that Uoo is determined by A = 7 • <5 in (4.7) of |GPW09a] we find 

(8.8) lim E[fx t {u : (i(B e (x)) < 5}} = lim : fi(B e (x)) < 5}} = 

8 — *-0 — >0 

by (4.9) and (4.11) of [GPW09a| . So, tightness follows. 

The assertion (ii) is an application of the duality relation from Proposition 

KT\ Fix 4> e Cl(R^'). We apply the duality relation between the 

tree- valued Fleming- Viot dynamics and the tree-valued Kingman coalescent 
which starts in £0 = {po,r! ) with := n G N} and rg_' = 0. Moreover 

we know from the construction of K that E &t o [4>(r_ t )] — > E[{iM x , (p)] and 
Vt {^}, as t — ► co where is the (rate 7) Kingman measure tree from 
(j235j> . Hence, by (f4TT2"D . 



lim E 5 " [(z/ w <, 0)1 = lim E 5 *o [ / ^"(dr) (f>((r) Vt + r/) 



(8.9) 



lim E 5 *o U(r')l 

£ — KX) — * 

E[(i^-,0)]. 



Since € (M+ ) was chosen arbitrarily, (ii) follows and we are done. □ 
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9. Proof of the applications (Proof of Theorems [4] and [5]). In 

this section we prove the results stated in Section [3j 

Proof of Lemma 13.11 Consider the traveling salesperson problem for a 
salesperson who must visit all leaves x±, ...,x n of the tree and who starts at 
one leaf to which she comes back at the end of the trip. It is easy to see that 
such a path must pass all edges in both directions, so the length of the path 
is at least twice the tree length. It is also easy to see that taking an optimal 
path and leaving out leaf X{ gives an optimal path for the remaining leaves 

Xl, . . ., Xi—l , , . .. , x n . 

We claim that there is one path connecting the set of leaves such that 
each edge in the tree is passed exactly twice, which is equivalent to the 
assertion of the Lemma. Assume to the contrary that such an order does 
not exist. We take a path of minimal length. There must be one edge which 
is visited at least four times. W.l.o.g. we assume that this edge is internal, 
i.e. not adjacent to a leaf. So there are four points X{,Xj,Xk,X[ € X, visited 
in the order Xi,Xj,X).,xi,Xi, such that [x^Xj] fl [x^ D x{\ is visited at least 
four times, where [x; y] is the path from x to y in X. Since leaving out leaves 
gives again an optimal path, leaving out all leaves except Xi,Xj,Xk,xi must 
lead to an optimal path connecting these four points. However, this optimal 
path must be Xi, Xj,xi,Xk, Xi (or its reverse), since this path passes all edges 
only twice. Hence, we have a contradiction and the assertion is proved. □ 

Proof of Theorem [H We first show injectivity of £. Assume we are 
given a compact ultra-metric measure space (Uo, ro, Ho) and its equivalence 
class «o = (Uq, ro, Ho)- We show that if 



We do this by explicitly reconstructing uq from A. 

We proceed in three steps. In the first two steps we consider the case where 
Ho is supported by finitely many atoms. In Step 1 we follow an argument 
provided to us by Steve Evans which explains how to recover the isometry 
class of (supp(/io), ro) from A. In Step 2 we then recover the measure ho- 
Finally, the case of a general element in U is obtained by approximation via 
finite ultra-metric measure spaces in Step 3. 





then 



(9.2) 



r X ({A}) = {uo}. 
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Step 1 (Evans's reconstruction procedure for finite trees). Assume that 
u G £ _1 ({A}) an d that u = (U,r,fj,) with #supp(/i) < oo. Put 

(9.3) A N := {(t x := 0,4, ■••) : 4 > 4-1 for exactly N-l different k}. 

First observe that #supp(/i) = N if and only if A is supported on Atv- That 
is, we can recover #supp(/i) from A. So, assume that /x has N atoms and 
w.l.o.g. U := {1, ...,N}. We will now recover r = {fi,j)i<i<j<N from A. 

For that purpose, introduce on the lexicographic ordering -<, i.e., £<£[ 
iff for k* := min{/c : 4 4^} we nave 4* < 4-*- Let 

(9.4) B := {£ e supp(A) : h < ... < £ N } 

be the space of all vectors £ which are accessible by sequentially sampling 
the N different points of U and evaluating subsequently the lengths of the 
sub-trees spanned by them. Moreover, let 

(9.5) t := min^S, 

i.e., £* := (^£)fc G N is the minimal element in B with respect to the order 
relation -<. 

W.l.o.g. we assume that the points in U are enumerated in such a way 
that for all n G {1, ...,N}, 

(9.6) t n :=L^\{l,...,n})- 

Notice that if d* denotes the depth of the sub-tree spanned by {1, n}, i.e., 
d* n := \ max{r(z, j); 1 < i, j < n}, for n £ N, then = and the recursion 

(9-7) < = ^(Ci+«-Ci)V<-i). 

holds for n > 2. 

We claim that we can even recover (^jj)i<i<j<7V from (^)n=i,...,JV- In fact, 
for all n G N, 

(9.8) r n _i ; „ = min r fen , 

l<fc<n— 1 

To see this, assume to the contrary that there is a minimal n G N for which 
we find a k < n — 1 such that rfc jn is minimal and rk, n < r n-i,n- Choose 
the minimal I with k < I < n — 1 and r£ n < r; n . Then, sampling the 
/ points 1, 2, k, I — l,n (in that order) leads to the sequence of tree 
lengths £t,£* 2 ,...,£l 

-i^i-i + \ r k,n- However, by the minimality of I we have 
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that rfc >n > ri-i tn and by the ultra-metric property r k ^ n < r/ in V ri-i >n = 
rj_ij. Hence, the above tree lengths are smaller (with respect to -<) than 
since ^* > + ^H-iji- So, assuming that (19. 8h does not 
hold contradicts the assumption that I* is minimal. 

However, from (|9.8|) we conclude the following recursion: for all n € 
{2, ...,N} and 1 < k < n- 1, 

(9.9) r fc , n = r fe , n _ x V 2(£ - C-i " K " 

The latter together with the necessary requirements that r„ jn := and 
ri 2 := \i% determines the metric on U uniquely. 

Step 2 (Reconstruction of weights in finite trees). In this step we will 
reconstruct weights (p\,...,pn) on ({1, N},r) from the given A. Denote 
by r C Sjv the set of permutations of {1,...,N} for which the metric r 
given in Step 1 satisfies n t j = r a u\ a fj\, for all 1 < i,j < N. Since we are 
interested in measure-preserving isometry classes only, we need to show that 
(pi, ...,pn) are uniquely determined up to permutations a € T. 

For all k = (k\, fcjv-i, &tv) €. {0, 1, ...} N ~ l x {oo}, define 

(9.10) l%:= (^=0,;^^^,^,^,...,^,^,^^,...) 

fel— times /c2— times £3— times 

where £* is the minimal subtree length vector in the support of A from Step 1. 
Observe that sampling from the subtree length distribution first the point 1 
a number of k\ + 1 times, then the point 2, then one of the points in {1, 2} 
a number of k% times, and so on, results exactly in the vector t k . Hence, 
taking all possible permutations a G T into account, 

N-l , 

WHIC^-EIKE^)' 

(9.1D tr ^ 

=A({fi)-E n ( e p*v)T- 

crgr 1=1 l<j<i 

We claim that (|9.1ip determines (pi,...,pjv) uniquely up to permutations 
a € T. 

To see this, observe that the algebra of functions on the iV — 1-dimensional 
simplex Sjy, generated by the functions 

JV-l 

(9.12) {f((pi,-,PN)) := II ( E Pj)**! *i.-,%-lGN } 

i=l l<j'<i 
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separates points. Hence, / € C^Sn) can be approximated uniformly by func- 
tions in this algebra by the Stone- Weierstrass Theorem. Hence, by knowing 
^({lk}) I ^({l*}) f° r an h-, using (|9.1ip . we also know the values of 

(9-13) ^2 f((Pa(l),-,Pa(n)) 

trer 

by an approximation argument. 

In particular, we can find the set A := {(p CT (i) , ■ ■■,p a (N)) '■ a G F}. By 
setting fi{i} = pi for an arbitrary (pi,...,pn) & A we have recovered fj, 
uniquely up to isometries such that = {"} by construction. 

Step 3 (General ultra-metric measure spaces). Let u = (U,r,fj) € 
£ _1 ({"o}) not necessarily finite anymore. We shall approximate u by finite 
ultra-metric measure spaces which we then treat as described in the first 
two steps. 

For that purpose, let for all e > 0, the (e-shrunken) pseudo-metric r £ on 
U by putting 

(9.14) r e :=0V(r-e). 

Notice that since (supp(/x),r) is ultra-pseudo-metric, (supp(/i), r e ) is ultra- 
pseudo-metric as well, for all e. 

Moreover, for all e > there is a covering of U of disjunct (closed) balls 
B\,B2, ... C U of radius e with p.{B\) > p,(B2) > . . .. Take N e large enough 
such that for B e = Uj=i &i we have n(B e ) > 1 — e and h{Bn £ ) > /j,(Bn £ +i)- 
Set (i e (-) := fJ,(-\B e ) and 

(9.15) u e :=(U, r e ,fi e ). 

Then u e is a finite metric measure space and u e — > u in the Gromov-weak 
topology, as e — ► 0. 

Given ui,U2,--- G U, set £ n := L^f ({ui, u n }) leading to the subtree 
length vector (4, £ 2 , ...) G We define the map 4 : M!J -> K!J given by 

(9.16) f : (4,^2,...) >-(*I,4.-) 

with £f = and = V (£2 — e) and for n > 3, recursively, 

(9.17) 11 := t n _ x + {£ n - 4-1 -\e) + . 
Moreover, set 

(9.18) 

A £)n := {(£1 = 0,^2,—) : 4 > 4-1 for exactly 2V e - 1 different i G {1, 2, n}}, 
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and we observe that 

(9.19) £("')(•) = (£)*v uS = ■ \As,n)- 

Now, take u € C~ 1 ({^})- Observe that Z £ — > u and u e — > « in the Gromov- 
weak topology, as e > 0. Hence we are in a position to apply Steps 1 and 2 
to find that Z e E ^ _1 (lim ri ^ 00 ^A(-|yl ein )) = {u 6 }, for all e > 0. This shows 
that u = lim e ^o " e = h m e^o " e = »• 

As for continuity of £, assume that (ufc)fceN is a sequence in U such that 
ufc — > u, for some u £ U, in the Gromov-weak topology, as fc — > oo. Then 
by definition, 3>(t/fc) — > "&(«), for all <3? € II , as k — > oo. In particular, 
since the map r i— > ^ n (r) is continuous as it is the minimum of finitely 
many continuous functions, for all n € N, (£(ufc),V') ~~ ► {^(« ) , '0) ; f° r an 
V 7 G or equivalently, C("fc) £(") m the weak topology on A4i(M+), 

as k — ► oo. □ 



Proof of Theorem [5j (i) Since £ is bijective on £(U), it is a con- 
sequence of Theorem 3.2 in |Kur98j that the martingale problem for 
(£*P(U), fiT'", n 1 '^) is well-posed. Moreover, by construction, (£(Ut))t>o 
solves the martingale problem. In addition, since IA has the Feller prop- 
erty and £ is continuous, 3 is Feller, too. The last assertion follows from the 
continuity of the sample paths of the tree-valued Fleming- Viot dynamics 
and the continuity of £. 

(ii) With I from ([3~j]) . 

^(^o£)(«) 

= (i/*,div(V> o|) +7 ^ (u u ,ip ° i° 6 k ,i - ip ° £) 

l<k<l 

(9.20) = £ ^« ^ £)) + 7 £ (n - 1) <«/^ o / Vl o £ - ^ o |) 

n>2 ° tn n>2 

d 

= n{£(u), "(f («0>^ °Pn-ip) 

n>2 ° ln n>l 

and we are done. □ 

To prepare the proof of Corollary 13.41 we investigate for each time t > 
the mean sample Laplace transform, 



(9.21) 



g(t;a) :=E[^(H t )], 
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of the subtree lengths distribution H f , where for a € M+, 

(9.22) tf£(E) : = f E(d£)^(£) 
with the test function 

(9.23) V-(^) :=exp{-(a,£)). 
As usual, (•, •) denotes the inner product. 

Lemma 9.1 (ODE system for the mean sample Laplace transforms). For 
a € 1R+ having only finitely many non-zero entries, the functions g(-',(r) 
satisfy the following system of differential equations: 

d 00 00 

(9.24) -g(t; a) = - ( £ ka k )g(t; a) + 7 £ k(g(t; r k a) - g(t; a)) 

at k=2 k=l 

with the merging operator 
(9.25) 

r k : (cri, cifc_i, a k , CTfc + i, cr k+2 , ...) h-> (ci, a k ^i,a k + <r fe+1 , cifc + 2, •••). 

Proof. By standard arguments, £ II ~ and 
(9.26) l^(t;oO=E[n^ s *2(H t )]. 

Hence, inserting (pOOj) . and using ^(fal) = 4> Tk -(l) for all k = 1, 2, . . 
with (5 k from (I9.20|) and T k from (|9.25p . we find 
(9.27) 

d , \ 



dt 
= E 



oo „ oo 



k=2 k=l 

oo 



= - (X kcr k)g(t,v) + 7^ k (9(t,T k a) -g(t,a)), 

k=2 k=l 

as claimed. □ 
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Remark 9.2. Recall, for each n G N, the function 5™ from t|3.13j> . For 
each n > 2 and cr > 0, applying (|9,24p to g_ = (c<5fc,n)fc>2 yields, setting 
^(t;<7) := 1, 



(9.28) 



^g n (t;a) = -nag n (t;a) +1 \^j (g n ~\t;a) - g n (t;a)) 
= ln(n - l)g n '\t; a) - l n fy + n- l)g n (t; 



(T 



i.e., 
(9.29) 



dt 
where 



^-(g 2 (t;a),g 3 (t;a), 



Aaa)(g 2 (t;a),g 3 (t;a),...) T + b T 



(9.30) b T := (2,0,...) T 

and for a > the matrix ^4 := A(a) is defined by 



(9.31) 

for all k,l>2. 



A k ,i := < 



k(k-l), iffc = Z + l, 

-jb(or + fc-l), iffc = Z, 
0, else, 



□ 



The proof of Corollary 13.41 uses the following preparatory lemma. 

Lemma 9.3. Fix a > 0. Let B = (-Bfc,z)fc,z>2 and B~ l = (B^j) k j> 2 be 
matrices defined by 
(9.32) 

M(^)r(g + 2 /) (-i) fc+ '¥fc 1 )r(5 + fc + z-i) 

B k i := — : — , and B,, 



F(a + k + l) ' 



k.l 



r(or + 2fc-l) 



(%) T/ie matrices B and B 1 are inverse to each other, 
(ii) The matrix A = A{a) = (^4fc,z)fc,z>2 has eigenvalues 

(9.33) \ k :=-k(a + k-l), k>2. 
(Hi) If D = (Xkh,l)k,l>2 then 

(9.34) f(A) = Bf(D)B- 1 

for all analytical functions f : M^ 2 — > M^ 2 . Specifically, A -1 = BD~ 1 B 
and e At = Be m B~ x for all t > 0. 



1d-i 
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(iv) For a > 0, let A~ 1 (a) = (A^ ])k,i>2 be given by A~^\ = for k < I and 

, i (k-iy.r(a + i-i) , , 

Then A -1 and A are inverse to each other. 



Proof. First, we note that A,A ,B,B _1 are lower triangular infinite 
matrices. This implies that the domain of the maps induced by these ma- 
trices is HL N . In particular, we do not have to distinguish between left- and 
right inverse matrices of A and B. 

(i) We need to show that 

(9.36) (B • B-% = 8 k ,i 

for k > I > 2. This is clear in the case where k < I. For k > I > 2, with 
constants C changing from line to line, and using the abbreviations k := k — l 
and a := a + 21 — 1, 
(9.37) 

k 

(B ■ B' 1 )^ = B k, m B~] 

m=l 

f &(;ar(g + 2m) (-ir+^:l)T{a + m + l-l) 
2-*>, r(cf + k + m) F(a + 2m - 1) 

m=l v ' x ' 



C V ( l) m+l (° + 2m - l ^ + m + l-l) 
(k - m)\(m - l)\T(a + k + m) 



m=l 



Cj2(-l) m (a + 2m)T(a + m) 



m=0 



T(k-m + l)r(m + l)T(a + k + m+l) 



= CY ( 1) m (t + 2m)T(6- + m) _ r(,7 + 2fc + l) 

r(m + l) r(a + fc + m + l)r(fc-m+l) 

= 0, 

where we have used that 

(<7 + 2m)T(a + m) T(a + m + l) T(a + m) 

1 j r(m + i) r(m + i)r(ff + i) r(m)r(ff + i) 

and then applied Formula (5d) on page 10 in |Rio68j . 
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(ii) Since A is lower triangular, this is obvious. 

(iii) Note that 

(9.39) (A ■ B) 2jl - \B 2jl = 

and 

A z - A fc = SF(fc - Z) + (/c 2 - fc - / 2 + Z) 
1 ' ' = (k-l)(a + k + l-l). 

Thus for all k > 3 and Z > 2, 

fc(A; - 1) 

(9 - 41) 5fc -' = (fe-zKaF + fc + z-i)^- 1 - 1 ' 

and since A kj k = Afc, 

(-4 ■ - A;i? fcj z = Ak t k-iBk-i t i + (Afc - Xi)B kt i 
(9-42) =(A:(fe-l)-fc(fc-l))B fc _ 1) , 

= 0, 

which proves that B contains all eigenvectors of A. Hence the claim follows 
by standard linear algebra. 

(iv) It is clear that (A • A~ l ) k ,k = 1, while for k ^ /, 
(9.43) 

{A ■ A~ l ) Kl = A ktk -! ■ A^\ , + A Kk ■ A k ] 

= k(k - D^r'y/'-' 1 ) - «» + 1 - 1) <* - 1)!r( * + 1 - l) 



l\T(a + k-l) v ' Z!r(5- + /c) 

0. 



□ 



Proof of Corollary 13.41 Fix n e N and a > 0. Put 

(9.44) :=0 n (2f; a). 

By (|9.28p . the vector /j, := (ft/ 7 ' 2 , h a ' 3 , ...) T satisfies the linear system of 
ordinary differential equations 
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or equivalently, 

(9.46) h(t) = e At h(0) + e Ai A~ x b - A~\ 

with b = (2, 0, 0, ...) T and A = (Ak,i)k,i>2 as defined in (I9.31|) . Consequently, 
if B, B^ 1 and D are as in Lemma 19.31 then 

(9.47) h(t) = -A~ 1 b + Be Dt (B~ 1 h(0) +D~ 1 B~ 1 b). 

Combining ([9. 44ft with (|9.47j) yield the explicit expressions given in (13.141) . 
Finally, by ([9T4TD . 

g n (f,a)^-2(AC-ay 1 ) 

t-*x V ' / n,2 

(9.48) = T in)T(a + l) 

T{n + a) 

= E[ e - CT SL 2 £fc ]. 

□ 
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